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ALLELE-SPECmC siRNA-MEDIATED GENE SILENCING 

Claim of Priority 
This is a continuation-in-part of application U.S. Application Serial No. 
10/430,351 ffled on May 5, 2003, which is a continuation of U.S. Application 
Serial No. 10/322,086 filed on December 17, 2002, which is a continuation-in- 
part ^plication of U.S. Application Serial No. 1 0/212,322, filed August 5, 2002. 

Statement Regarding Federally Sponsored Research Or Development 
Work relating to this application was supported by grants from the 
National Institutes of Health (NS044494 and NS38712). The government may 
have certain rights in the inventioa 



. Background of the Invention 
Double-stranded RNA (dsRNA) can induce sequence^specific 
posttranscriptional gene silencing in many organisms by ^.process known as 
RNA interference (RNAi). However, in mammalian cells, dsRNA that is 30 

20 base pairs or longer can induce sequence-nonspecific responses that trigger a 
shut-down of protein synthesis. Recent work suggests that RNA firagments are 
the sequence-specific mediators of RNAi (Elbashir et ai, 2001). Interference of 
gene expression by these small interfering RNA (siRNA) is now recognized as a 
naturally occurring strategy for silencing genes in C elegans, Drosophila, 

25 plants, and in mouse embryonic stem cells, oocj^es and early embryos (Cogoni 
et ai, 1994; Baulcombe, 1996; Kennerdell, 1998; Timmons, 1998; Waterhouse 
et ai, 1998; Wianny and Zemicka-Goetz, 2000; Yang etaL, 2001; Svoboda et 
al, 2000). In mammalian cell culture, a siRNA-mediated reduction in gene 
expression has been accomplished only by transfecting cells with synthetic 

30 RNA oligonucleotides (Caplan et ai, 2001; Elbashir et al, 2001). 
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Suromary of the Inventioii 
The present invention provides a mammaUan cell containing an isolated 
first strand of KNA of 15 to 30 nucleotides in length, and an isolated second 
strand of RNA of 15 to 30 nucleotides in length, wherein the first strand contains 
5 a sequence that is conq)lementary to at least 15 contiguous nucleotides of a 
targeted gene of interest, wherein at least 12 nucleotides of the first and second 
strands are complementary to each other and form a small interfering RNA 
(siRNA) duplex under physiological conditions, and wherein the siRNA silences 
only one allele of the targeted gene in the cell. The duplex of the siRNA may be 

10 between 15 and 25 base pairs in length. The two strands of RNA in the siRNA 
may be completely complementary, or one or the other of the strands may have 
an "overiiang region" (i.e., a portion of the RNA that does not bind with the 
second strand). These overhangs may be at the 3* end or at the 5' overhang 
region, or at both 3' and 5' ends. Such overhang regions may be jfrom 1 to 10 

15 nucleotides in length. In the present invention, the first and second strand of 
RNA may be operably Unked together by means of an RNA loop strand to form 
a hairpin structure to form a "duplex stmcture" and a "loop structure." These 
loop structures may be firom 4 to 10 nucleotides in length. For example, the loop 
structure may be 4, 5 or 6 nucleotides long. 

20 The present invention also provides a mammalian cell that contains an 

expression cassette encoding an isolated first strand of RNA of 15 to 30 
nucleotides in length, and an isolated second strand of RNA of 15 to 30 
nucleotides in length, wherein the first strand contains a sequence that is 
complementary to at least 15 contiguous nucleotides of a targeted gene of 

25 interest, wherein at least 12 nucleotides of the first and second strands are 

complementary to each other and form a small interfering RNA (siRNA) duplex 
imder physiological conditions, and wherein the siRNA silences only one allele 
of the targeted gene in the cell. These expression cassettes may further contain a 
promoter. Such promoters can be regulatable promoters or constitutive 

30 promoters. Examples of suitable promoters include a CMV, RSV, pol n or pel 
in promoter. The expression cassette may finiher contain a polyadenylation 
signal, such as a synthetic minimal polyadenylation signal The expression 
cassette may fiurther contain a marker gene. The expression cassette may be 
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contained in a vector. Examples of appropriate vectors include adenoviral, 
lentiviial, adeno-associated viral (AAV), poliovirus, HS V. or murinfi Maloney- 
based viral vectors. Iq one anbodiment, the vector is an adenoviral vector. 
In the present invention, the alleles of the tar^ted gene may differ by 
5 seven or fewer base pairs out of 21 base pairs (e.g., 7, 6, 5, 4, 3, 2 or 1 base 
pairs). They may even differ by only one base pair out of 21 base pairs. 
Examples of targeted gene transcripts include transcripts encoding a beta- 
glucmonidase, TorsinA, Ataxin-3, Tau, or huntingtin. The targeted genes and 
gene products (i.e., a transcript or protein) may be from different species of 
10 organisms, such as a mouse allele or a human aUele of a target gene. 

The present invention also provides an isolated RNA duplex containing a 
first strand of RNA and a second strand of KNA, wherein the first stnind 
contains at least 15 contiguous nucleotides complementary to mutant TorsinA 
encoded by SEQ ID NO:55, and wherein the second strand is complementary to 
15 at least 12 contiguous nucleotides of the first strand. In one embodiment of the 
invention (mutA-si), the first strand of RNA is encoded by SEQ ID NO:49 and 
the second strand of RNA is encoded by SEQ ID NO:50. hi an alternative 
embodiment (mutB-si), the first strand of KNA is encoded by SEQ ID N0:5I 
and the second strand of RNA is encoded by SEQ ID NO:52. In another 
20 embodiment (mutC-si), the first strand of RNA is encoded by SEQ ID NO:53 

and second strand of RNA is aicoded by SEQ ID NO:54. As used herein the ' 
temi "encoded by" means that the DNA sequence in the SEQ ID NO is 
transcribed into the RNA of interest. This term is used in a broad sense, similar 
to the term "comprising" in patent temimology. For example, the statement "the 
25 first strand of RNA is encoded by SEQ ID NO:49" means that the first strand of 
RNA sequence coiresponds to the DNA sequence indicated in SEQ ID NO:49, 
but may also contain additional nucleotides at either the 3' end or at the 5' end of 
the RNA molecule. 

The present invention finlher provides an RNA duplex containing a first 
30 strand of RNA and a second strand of RNA, wherein the first strand contains at 
least 15 contiguous nucleotides complementary to mutant Ataxin-3 transcript 
encoded by SEQ ID N0:8, and wherein the second strand is complementary to at 
least 12 contiguous nucleotides of flie fiist strand. In one embodiment (siC7/8), 
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the first strand of RNA is encoded by SEQ ID NO: 1 9 and the second strand of 
KNA is encoded by SEQ ID NO: 20. In another embodiment (siClO), the first 
strand of RNA is encoded by SEQ ID N0:21 and the second strand of RNA is 
encoded by SEQ BD NO:22. 
5 The present invention further provides an RNA duplex containing a first 

strand of RNA and a second strand of RNA, wherein the first strand contains at 
least 15 contiguous nucleotides complementary to mutant Tau transcript encoded 
by SEQ ID NO:39 (siA9/C12), and wherein the second strand is complementary 
to at least 12 contiguous nucleotides of the first strand. The second strand may 
10 be encoded by SEQ ID NO:40. 

The RNA d\q)lexes of the present invention are between 15 and 30 base 
pairs in length. For example they may be between 19 and 25 base pairs in 
length. As discussed above the first and/or second strand further comprises an 
overhang region. These overhangs may be at the 3' end or at the 5' overhang 
15 region, or at both 3* and 5* ends. Such overhang regions may be from 1 to 10 
nucleotides m length. In the present invention, the first and second strand of 
RNA may be operably linked togeflier by means ofan RNA loop strand to fonn 
a hairpin structure to form a "duplex structure" and a "loop structure." These 
loop structures may be fi-om 4 to 10 nucleotides in lengtii. For example, tiie loop 
20 structure may be 4, 5 or 6 nucleotides long. 

In the present mvention, an expression cassette may contain a nucleic 
acid encoding at least one strand of the RNA duplex described above. Such an 
expression cassette may finther contain a promoter. The expression cassette 
may be contained in a vector. These cassettes and vectors may be contained in a 
25 cell, such as a mammalian cell. A non-human mammal may contain the cassette 
or vector. The vector may contain two expression cassettes, tiie first expression 
cassette contauiing a nucleic acid encoding the first strand of the RNA duplex, 
and a second expression cassette containing a nucleic acid encoding the second 
strand of the RNA duplex. 
30 The present invention further provides a method of performing allele- 

specific gene silencing in a mammal by administering to tiie mammal an isolated 
first strand of RNA of 15 to 30 nucleotides in lengtii, and an isolated second 
strand of RNA of 15 to 30 nucleotides in length, wherem the first strand contains 
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at least 15 contiguous nucleotides complementary to a targeted gene of intraest, 
wherein at least 12 nucleotides of the first and second strands ar« complementary 
to each other and form a smaU interfering RNA (siRNA) duplex under 
physiological conditions, and wherein the siRNA silences only one allele of the 
5 targeted gene in the mammal. The aUelesofthe gene may differ by seven or 
fewer base pairs out of 21 base pairs, such as by only one base pair. In one 
escample, the gene is a beta-glucuronidase gene. The alleles may be murine- 
specific and human-specific alleles of beta-glucuxonidse. Examples of gene 
transcripts include an RNA transcript complementary to TorsinA, Ataxm-3, 
10 hmtingtin or Tau. The targeted gene may be a gene associated with a condition 
amenable to siRNA therapy. For example, the condition amenable to siRNA 
therapy could be a disabling neurological disorder. •'Neuixjlogical disease" and 
"neurological disorder" refer to both hereditary and sporadic conditions that are 
characterized by nervous system dysfimction, and which may be associated witii 
15 atrophy of the affected central or peripheral nervous system structures, or loss of 
• function without atrophy A neurological disease or disorder that results in 
atrophy is commonly called a "neurodegenerative disease" or 
•'neurodegenerative disorder." Neurodegenerative diseases and disorders 
include, but are not limited to, amyotrophic lateral sclerosis (ALS), hereditary 
20 spastic hemiplegia, primary lateral sclerosis, spinal muscular atrophy, Kennedy's 
disease, Alzheuner's disease, Paridnson's disease, multiple sclerosis, and repeat 
expansion neurodegenerative diseases, diseases associated with expansions 
of trinucleotide rq)eate such as polygjutamine ^lyQ) r^eat diseases, e.g., 
Huntington's disease (HD), spinocerebellar ataxia (SCAl, SCA2, SCA3, SCA6, 
25 SCA7, and SCA17), spinal and bulbar muscular atrophy (SBMA), 

dentatorubropalUdoluysian atrophy (DRPLA). An example of a disabUng 
neurological disorder that does not appear to result in atrophy is DYTl dystonia. 
The gene of interest may encode a ligand for a chemokine involved in the 
migration of a cancer cell, or a chemokine receptor. 
30 The present invention fiirther provides a method of substantiaUy the 

silencing target aUele while allowing substantiaUy continued expression of a 
wild-type aUele by conferring on the ceU the abiUty to express siRNA as an 
expression cassette, wherein the ejq)ression cassette contains a nucleic acid 
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encodmg a small interfering RNA molecule (siRNA) targeted against a target 
allele, wherein expression from the targeted allele is substantially silenced but 
wherein expression of the wild-type allele is not substantially silenced. 

The present invention provides a method of treating dominantly inherited 
5 disease in an allele-specific manner by administering to a patient in need thereof 
an expression cassette, wherein the expression cassette contains a nucleic acid 
encoding a small interfering RNA molecule (siRNA) targeted against a target 
allele, wherein expression from flie target allele is substantially silenced but 
wherem expression of the wild-type allele is not substantially silenced. 

1 0 The present invention also provides a method of performing allele- 

specific gene silencing by administering an expression cassette containing a pol 
n promoter operably-linked to a nucleic acid encoding at least one strand of a 
small mterfering RNA molecule (siRNA) targeted against a gene of interest, 
wherein the siRNA silences only one allele of a gene. 

15 The present invention provides a method of performing allele-specific 

gene silencing in a mammal by administering to the mammal a vector containing 
an expression cassette, wherein the expression cassette contains a nucleic acid 
encoding at least one strand of a small interfering RNA molecule (siRNA) 
targeted against a gene of interest, wherein the siRNA silences only one allele of 

20 a gene. 

Moreover, the present invention provides a method of screening of allele- 
specific siRNA duplexes involving contacting a cell containing a predetennined 
mutant allele with an siRNA with a known sequence, contacting a cell 
containing a wild-type allele with an siRNA with a known sequence, and 

25 determining if the mutant allele is substantially silenced while the wild-type 
allele retains substantially normal activity. 

The present invention also provides a method of screening of allele- 
specific siRNA duplexes involving contacting a cell containing a predetennined 
mutant allele and a wild-type allele with an siRNA with a known sequence, and 

30 determining if the mutant allele is substantially silenced while the wild-type 
allele retains substantially normal activity. 
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Also provided is a method for detennining the function of an allele by 
contacting a cell containing a predetermined allele with an siKNA with a known 
sequence, and determining if the function of the allele is substantially modified. 

The present invention further provides a method for detennining the 
5 function of an allele by contacting a cell containing a predetermined mutant 
allele and a wild-type allele with an siRNA with a known sequence, and 
detennining if the function of the allele is substantially modified while the wild- 
type allele retains substantially normal function. 

10 

Brief Description of the Figures 
This patent or application file contains at least one drawing executed in 
color. Copies of this patent or patent application publication with color 
drawing(s) will be provided by the Office upon request and payment of the 

15 necessary fee. 

Figure 1 . siRNA expressed from CMV promoter constructs and in vitro 
effects. (A) A cartoon of the expression plasmid used for expression of 
functional siRNA in cells. The CMV promoter was modified to allow close 
juxtaposition of the haiipin to the transcription initiation site, and a minimal 

20 polyadenylation signal containmg cassette was constructed immediately 3 ' of the 
MCS (mCMV, modified CMV; mpA, minipA). (B, C) Fluorescence 
photomicrographs of HBK293 cells 72 h after transfection of pEGFPNl and 
pCMVpgal (control), or pEGFPNl and pmCMVsiGFPmpA, respectively. (D) 
Northern blot evaluation of transcripts harvested from pmCMVsiGFPmpA 

25 (lanes 3, 4) and pmCMVsiBgalmpA (lane 2) transfected HEK293 cells. Blots 
were probed with ^^P-labeled sense oligonucleotides, Antisense probes yielded 
similar results (not shown). Lane 1, ^^P-labeled RNA markers. AdsiGFP 
infected cells also possessed appropriately sized transcripts (not shown). (E) 
Northern blot for evaluation of target mRNA reduction by siRNA (upper panel), 

30 The internal control GAPDH is shown in the lower panel. HEK293 cells were 
transfected with pEGFPNl and pmCMVsiGFPmpA, expressing siGFP, or 
plasmids expressing the control siRNA as indicated. pCMVeGFPx, which 
expresses siGFPx, contains a large poly(A) cassette from SV40 large T and an 
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unmodified CMV promoter, in contrast to pmCMVsiGFPmpA shoivn in (A). 
(F) Western blot with anti-GFP antibodies of cell lysates harvested 72 h after 
transfection withpEGFPNl and pCMVsiGFPmpA, orpEGFPNl and 
pmCMVsipglucmpA. (G, H) Fluorescence photomicrographs of HEK293 cells 
5 72 h after transfection of pEGFPNl and pCMVsiGFPx, or pEGFPNl and 
pmCMVsiBglucmpA, respectively. (I, J) siRNA reduces expression from 
endogenous alleles. Recombmant adenoviruses were generated from 
pmCMVsiPglucmpA and pmCMVsiGFPmpA and purified. HeLa cells were 
infected with 25 infectious viruses/ceU (MOT = 25) or mock-infected (control) 
10 and cell lysates harvested 72 h later. (I) Northern blot for 6-gIucuronidase 

mKNA levels in Adsifigluc and AdsiGFP transduced cells. GAPDH was used as 
an internal control for loading. (J) The concentration of p-glucuronidase activity 
in lysates quantified by a fluorometric assay. Stein,' C.S. et al., J. Virol. 73:3424- 
3429 (1999), 

1 5 Figure 2. Viral vectors expressing siRNA reduce expression from 

transgenic and endogenous alleles in vivo. Recombinant adenovirus vectors 
were prepared from the siGFP and sipgluc shuttle plasmids described in Fig. 1. 
(A) Fluorescence microscopy reveals diminution of eGFP expression in vivo. Jn 
addition to the siRNA sequences in the El region of adenovirus, RFP e3q)ression 
20 cassettes in E3 faciUtate localization of gene transfer. Representative 

photomicrographs of eGFP (left), RFP (middle), and merged images (right) of 
coronal sections from mice injected with adenoviruses expressing siGFP (top 
panels) or siPgluc (bottom panels) demonstrate siRNA specificity in eGFP 
transgenic mice striata after direct brain injection. (B) Full coronal brain 
25 sections (1 mm) harvested from AdsiGFP or Adsipgluc injected mice were split 
into hemisections and both ipsilateral (il) and contralateral (cl) portions 
evaluated by western blot using antibodies to GFP. Actin was used as an 
internal control for each sample. (C) Tail vein injection of recombinant 
adenoviruses expressing sipgluc directed against mouse P-glucuronidase 
30 (AdsiMuPgluc) reduces endogenous p-glucuronidase RNA as determined by 
Northem blot m contrast to control-treated (AdsiBgal) mice. 

Figure 3. siGFP gene transfer reduces Q19-eGFP expression in cell 
lines. PC12 cells expressing the polyglutamine repeat Q19 fiised to eGFP 
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(eGFP-Q19) under tetracycline repression (A, bottom left) were washed and 
dox-free media added to allow eGFP-Q19 expression (A, top left). 
Adenoviruses were applied at the indicated multiplicity of infection (MOI) 3 
days after dox removal. (A) eGFP fluorescence 3 days after adenovirus- 
5 mediated gene transfer of Adsipgluc (top panels) or AdsiGFP (bottom panels). 
(B, C) Western blot analysis of cell lysates harvested 3 days after infection at the 
indicated MOIs demonstrate a dose-dependent decrease in GFP-Q19 protein 
levels. NV, no virus. Top lanes, eGFP-Q19. Bottom lanes, actin loading 
controls. (D) Quantitation of eGFP fluorescence. Data represent mean total area 

1 0 fluorescence ± standard deviation in 4 low power fields/well (3 wells/plate). 

Figure 4. siRNA mediated reduction of expanded polyglutamine protein 
levels and intracellular aggregates. PC12 cells expressing tet-repressible eGFP- 
Q80 fiision proteins were washed to remove doxycycline and adenovirus vectors 
expressing siRNA were applied 3 days later. (A-D) Representative punctate 

15 eGFP fluorescence of aggregates ia mock-infected cells (A), or those infected 
with 100 MOI of Adsipgluc (B), AdsiGFPx (C) or Adsipgal (D). (E) Three days 
after infection of dox-free eGFP-Q80 PC12 cells with AdsiGFP, aggregate size 
and number are notably reduced. (F) Westem blot analysis of eGFP-Q80 
aggregates (arrowhead) and monomer (arrow) following Adsipgluc or AdsiGFP 

20 infection at the indicated MOIs demonstrates dose dependent siGFP-mediated 
reduction of GFP-Q80 protein levels. (G) Quantification of the total area of 
fluorescent inclusions measured in 4 independent fields/well 3 days after virus 
was applied at the indicated MOIs. The data are mean ± standard deviation. 
Figure 5. RNAi-mediated suppression of expanded CAG repeat containing 

25 genes. Expanded CAG repeats are not direct targets for preferential inactivation 

(A) , but a linked SNP can be exploited to generate siRNA that selectively 
silences mutant ataxin-3 expression (B-F). (A) Schematic of cDNA encodmg 
generaUzed polyQ-fluorescent protein ftisions. Bars indicate regions targeted by 
siRNAs. HeLa cells co-transfected with Q80-GFP, Q19-REP and the indicated 

30 siRNA Nuclei are visuaUzed by DAPI staining (blue) in merged images. 

(B) Schematic of human ataxin-3 cDNA with bars indicating regions targeted by 
siRNAs. The targeted SNP (G987C) is shown in color. In the displayed siRNAs, 



9 



wo 2004/013280 



PCT/US2003/016887 



red or blue bars denote C or G respectively. In fliis Figure, 
AGCACSCAGCAGGGGGACCTATCAGGAC is SEQ ID N0:7. and 
CAGCAGCAGCAGCGGGACCTATCAGGAC is SEQ ID NO:8. (C) 
Quantitation of fluorescence in Cos-7 cells transfected with wild ^e or mutant 
5 ataxin-3-GFP expression plasmids and the indicated siRNA. Fluorescence from 
cells co-transfected with siMiss was set at one. Bars depict mean total 
fluorescence 6om three independent experiments +/- standard error of the mean 
(SBNQ. (D) Western blot analysis of cells co-transfected with the indicated 
ataxin-3 expression plasmids (top) and siRNAs (bottom). Appearance of 
10 aggregated, mutant ataxin-3 in the stacking gel (seen with siMiss and siGlO) is 
prevented by siRNA mhibition of the mutant aUele. (E) Allele specificity is 
retained in the simulated heterozygous state. Western blot analysis of Cos-7 cells 
cotransfected with wild-type (atx-3-Q28-GEP) and mutant (atx-Q166) 
expression plasmids along with the indicated siRNAs. (Mutant ataxin-3 detected 
15 with 1C2, an antibody specific for expanded polyQ, and wild-type ataxin-3 

detected with anti-ataxin-3 antibody.) (F) Western blot of Cos-7 cells transfected 
with Atx-3-GFP expression plasmids and plasmids encoding the indicated 
shRNA. The negative control plasmid, phU6-LacZi, encodes siRNA specific for 
LacZ. Both normal and mutant protein were detected with anti-ataxin-3 
antibody. Tubulin immunostaining shown as a loading control in panels (D)-(F). 

Figure 6, Primer sequences for in vitro synthesis of siRNAs using T7 
polymerase. All primers contain the followmg T7 promoter sequence at their 3' 
ends: 5'-TATAGTGAGTCGTATTA-3' (SEQ TD N0:9). The foUowing primer 
was annealed to all oligos to synthesize siRNAs: 5'- 
25 TAATACGACTCACTATAG-3' (SEQ ID NO:10). 

Figure 7. hiclusion of either two (siC7/8) or three (siClO) CAG triplets 
at the 5' end of ataxin-3 siRNA does not inhibit expression of unrelated CAG 
repeat containing genes. (A) Western blot analysis of Cos-7 cells transfected 
with CAG repeat-GFP fiision proteins and the indicated siRNA. 
30 Immunostaining with mwioclonal anti-GFP antibody (MBL) at 1 : 1000 dilution. 
(B) Westem blot analysis of Cos-7 cells transfected with Flag-tagged ataxin-1- 
Q30, which is unrelated to ataxin-3, and the indicated siRNA. hnmunostaining 
with anti-Flag monoclonal antibody (Sigma St Louis, MO) at 1:1000 dUution. 
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In panels (A) and (B), lysates were collected 24 hours after transfection. Tubulin 
immunostaining shown as a loading control. 

Figure 8. shRNA-e3q)ressing adenovirus mediates allele-specific 
silencing in transiently transfected Cos-? cells simulating the heterozygous state. 
5 (A) Represetttative images of cells cotransfected to express wild type and 

mutant ataxm-3 and infected with the indicated adenovirus at 50 multiplicities of 
mfection (MOT). Atx-3-Q28-GFP (green) is directly visualized and Atx-3-Q166 
(red) is detected by immunofluorescence with 1C2 antibody. Nuclei visualized 
with DAPI stain in merged images. An average of 73.1% of cells co-expressed 
10 both ataxin-3 proteins with siMiss. (B) Quantitation of mean fluorescence from 2 
independent experiments performed as in (A). (C) Western blot analysis of viral- 
mediated silencing in Cos-7 cells expressing wild type and mutant ataxin-3 as in 
(A). Mutant ataxin-3 detected with 1C2 antibody and wild-type human and 
endogenous primate ataxin-3 detected with anti-atajdn-3 antibody, (D) shRNA- 
1 5 expressing adenovirus mediates allele-specific silencing in stably transfected 
neural cell lines. Differentiated PC12 neural cells e3q)ressing wild type (left) or 
mutant (right) ataxin-3 were infected with adenovirus (100 MOT) engineered to 
express the indicated hairpin siRNA Shown are Western blots immunostained 
for ataxin-3 and GAPDH as loading control. 
20 Figure 9. Allele-specific siRNA suppression of a rrdssense Tau 

mutation. (A) Schematic of human tau cDNA with bars indicating regions and 
mutations tested for siENA suppression. Of these, the V337M region showed 
effective siq)pression and was further studied. Vertical bars represent 
microtubule binding repeat elements in Tau. In the displayed siRNAs, blue and 
25 red bars denote A and C respectively. In this Figure, 

GTGGCCAGATGGAAGTAAAATC is SEQ ID NO:35, and 
GTGGCCAGGTGGAAGTAAAATCisSEQIDN0:41. (B) Western blot 
analysis of cells co-transfected with WT or V337M Tau-EGFP fusion proteins 
and the mdicated siRNAs. Cells were lysed 24 hr after transfection and probed 
30 with anti-tau antibody. Tubulin immunostaining is shown as loading control. (C) 
Quantitation of fluorescence in Cos-7 cells transfected with wild type tau-EGFP 
or mutant V337M tau-EGFP expression plasmids and the indicated siRNAs. 
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Bars depict mean fluorescence and SEM from three independent experiments. 
Fluorescence from cells co-transfected with siMiss was set at one. 

Figure 10. Allele-specific silencing of Tau in cells simulating the 
heterozygous state. (A) Representative fluorescent images of fixed Hela cells co- 
5 transfected with flag-tagged WT-Taa (red), V337M-Tau-GFP (green), and the 
indicated siRNAs. An average of 73.7% of cells co-expressed both Tau proteins 
with siMiss. While siA9 suppresses both alleles, siA9/C12 selectively decreased 
expression of mutant Tau only. Nuclei visualized with DAPI stain in merged 
images. (B) Quantitation of mean fluorescence from 2 independent experiments 

1 0 performed as in (A). (C) Western blot analysis of cells co-transfected with Flag- 
WT-Tau and V337M-Tau-EGFP fusion proteins and the indicated siRNAs. Cells 
were lysed 24 hr after transfection and probed with anti-tau antibody. V337M- 
GFP Tau was differentiated based on reduced electrophoretic mobility due to 
the addition of GFP. Tubulin immunostaining is shown as a loading control 

1 5 Figure 1 1 . Schematic diagram of allele-specific silencing of mutant 

TorsinA by small interfering RNA (siRNA). In the disease state, wild type and 
mutant alleles of TORI A are both transcribed into mRNA. siRNA Avith sequence 
identical to the mutant allele (deleted of GAG) should bind mutant mRNA 
selectively and mediate its degradation by the RNA-induced silencing complex 

20 (RISC) (circle). Wild type mRNA, not recognized by the mutant -specific 

siRNA, will remain and continue to be translated into normal TorsinA. The two 
adjacent GAG's in wild type TORI A alleles are shown as two parallelograms, 
one of which is deleted in mutant TORI A alleles. 

Figure 12, Design and targeted sequences of siRNAs. Shovm are the 

25 relative positions and targeted mRNA sequences for each primer used in this 
study. Mis-siRNA (negative control) does not target TA; com-siRNA targets a 
sequence present in wild type and mutant TA; wt-siRNA targets only wild type 
TA; and three mutant-specific siRNAs (Mut A, B, C). preferentially target 
mutant TA. The pair of GAG codons near the c-terminus of wild type mRNA are 

30 shown in underlined gray and black, with one codon deleted in mutant mRNA. 

Figure 13. siRNA silencing of TAwt and TAmut in Cos-7 cells. (A) 
Western blot results showing the effect of different siRNAs on GFP-TAwt 
expression levels. Robust suppression is achieved with wt-siRNA and com- 
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siKNA, while the mutant-specific siKNAs MutA, (B) and (C) have modest or no 
effect on GFP-TAwt expression. Tubulin loading controls are also shown. (B) 
Similar experiments with cells expressing HA-TAmut, showing significant 
suppression by mutant-specific siRNAs and com-siRNA but no suppression by 
5 the wild type-specific siRNA, wt-siRNA. (C) Quantification of results firom at 
least three separate experiments as in A and B. (D) Cos-7 cells transfected with 
GFP-TAwt or GFP-TAmut and different siRNAs visualized under fluorescence 
microscopy (200X). Representative fields are shown indicating allele-specific 
suppression. (E) Quantification of fluorescence signal &om two different 

1 0 experiments as in D. 

Figure 14. Allele-specific silencing by siRNA in the simulated 
heterozygous state. Cos-7 cells were cotransfected with plasmids encoding 
differentially tagged TAwt and TAmut, together with the indicated siRNA. (A) 
Western blot results analysis showing selective suppression of the targeted allele 

1 5 by Wt-siRNA or mutC-siRNA. (B) Quantification of results fi:om three 
experiments as in (A). 

Figure 15, Allele-specific silencing of mutant huntingtin by siRNA. 
PC6-3 cells were co-transfected with plasmids expressing siRNA specific for the 
polymorphism encoding the transcript for mutant huntingtin. 

20 

Detailed Description of the Invention 

Modulation of gene expression by endogenous, noncoding RNAs is 
increasingly appreciated as a mechanism playing a role in eukaryotic 
development, maintenance of chromatin stmcture and genomic integrity 

25 (McManus, 2002). Recently, techniques have been developed to trigger RNA 
interference (RNAi) against specific targets in mammalian cells by introducing 
exogenously produced or intracellularly expressed siRNAs (Elbashir, 2001; 
BruDimelkamp, 2002). These methods have proven to be quick, inexpensive and 
effective for knockdown experiments in vitro and in vivo (2 Elbashir, 2001; 

30 Brummelkamp, 2002; McCaffrey, 2002; Xia, 2002). The ability to accomplish 
selective gene silencing has led to the hypothesis that siRNAs might be 
employed to suppress gene expression for therapeutic benefit (Xia, 2002; Jacque, 
2002; Githn, 2002). 
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KNA interference is now established as an important biological strategy 
for gene silencing, but its ^plication to mammalian cells has been limited by 
nonspecific inhibitory effects of long double-stranded RNA on translation. 
Moreover, deUvery of interfering RNA has largely been limited to administration 
5 of RNA molecules. Hence, such administration must be performed repeatedly to 
have any sustained effect. The present inventors have developed a deUvery 
mechanism that results in specific sUencing of targeted genes through expression 
of smaU interfering RNA (siRNA). The inventors have markedly diminished 
sq)ression of exogenous and endogenous genes in vitro and in vivo in brain and 
10 liver, and finther apply this novel strategy to a model system of a major class of 
neurodegenerative disorders, the polyglutamine diseases, to show reduced 
polyglutamine aggregation in cells. This strategy is generaUy useful in reducing 
expression of target genes in order to model biological processes or to provide 
ther^y for dominant human diseases. 
1 5 Disclosed herein is a strategy that results in substantial silencing of 

targeted alleles via siRNA. Use of this strategy results in markedly diminished 
in vitro and in vivo expression of targeted alleles. This strategy is useful in 
reducing expression of targeted aUeles in order to model biological processes or 
to provide therapy for human diseases. For example, this strategy can be applied 
20 to a major class of neurodegenerative disorders, the polyglutamine diseases, as is 
demonstrated by the reduction of polyglutamine aggregation in cells foUowing 
application of the strategy. As used herein Ihe term "substantial silencing" 
means fliat the mRNA of the targeted aUele is inhibited and/or degraded by the 
presence of the introduced siRNA, such that expression of the targeted allele is 
25 reduced by about 10% to 100% as compared to the level of expression seen 
when the siRNA is not present. Generally, when an aUele is substantially 
silenced, it wiU have at least 40%, 50%, 60%, to 70%, e.g., 71%, 72%, 73%, 
74%, 75%, 76%, 77%, 78%, to 79%, generally at least 80%, e.g., 81%-84%, at 
least 85%, e.g., 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 
30 97%, 98%, 99% or even 100% reduction expression as compared to when the 
siRNA is not present As used herein the term "substanitaly nomal activity" 
means the level of expression of an aUele when an siRNA has not been 
introduced to a cell. 



14 



wo 2004/013280 



PCTAJS2003/016887 



Dominantly inherited diseases are ideal candidates for siRNA-based 
ther^y. To explore the utility of siRNA in inherited human disoiders, the 
present inventors enoiployed cellular models to test whether mutant alleles 
responsible for these dominantly-inherited human disorders could be specifically 
5 targeted. First, three classes of donainantly inherited, untreatable 
neurodegenerative diseases were examined: polyglutamine (polyQ) 
neurodegeneration in MJD/SCA3, Huntington*s diseaseand Jfrontotempoial 
dementia with paridnsonism linked to chromosome 17 (FIDP-IT). Machado- 
Joseph disease is also know as Spinocerebellar Ataxia Type 3 (The HUGO 
10 official name is MID), The gene involved is MJDl, which encodes for the 
protein ataxin-3 (also called Mjdlp). Huntington's disease is due to expansion 
of tiie CAG repeat motif in exon 1 of huntingtin. In 38% of patients a 
polymorphism exists in exon 58 of the huntingtin gene, allowing for allele 
specific targeting. Frontotemporal dementia.(sometimes witii paikinonism, and 
15 linked to chromosome 17, so sometimes called FrDP-17) is due to mutations in 
the MAPTl gene that encodes the protein tau. 

The polyQ neurodegenerative disorders include at least nine diseases 
caused by CAG repeat expansions that encode polyQ in flie disease protein. 
PolyQ expansion confers a dominant toxic property on tiie mutant protein that is 
20 associated witii aberrant accumulation of the disease protein in neurons (Zoghbi, 
2000). Li FTDP-n, Tau mutations lead to the formation of neurofibrillary 
tangles accompanied by neuronal djrefimction and degeneration (Poorkaj, 1998; 
Hutton, 1998). The precise mechanisms by which fliese mutant proteins cause 
neuronal injury are unknown, but considerable evidence suggests tiiat the 
25 abnormal proteins fliemselves initiate the pathogenic process (Zoghbi, 2000). 
Accordmgly, eliminating expression of the mutant protein by siKNA or other 
means slows or prevents disease (Yamamoto, 2000). However, because many 
dominant disease genes also encode essential proteins (e.g. Nasir, 1995) siRNA- 
mediated approaches were developed tiiat selectively inactivate mutant alleles, 
30 while allowing continued expression of the wild type proteins ataxin-3 and 
huntingtin. 

Second, the dominantly-inherited disorder DYTl dystonia was studied. 
DYTl dystonia is also known as Torsion dystonia type 1, and is caused by a 
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GAG deletion in the TORI A gene encoding torsinA. DYTl dystonia is the most 
common cause of primary generalized dystonia. DYTl usually presents in 
childhood as focal dystonia that progresses to severe generalized disease (Fahn, 
1998; Klein, 2002a). With one possible exception (Leung, 2001; Doheny, 2002; 
5 Klein, 2002), all cases of DYTl result from a common GAG deletion in TORI A, 
eliminating one of two adjacent glutamic acids near the C-terminus of the 
protein TorsinA (TA) (Ozehus, 1997). Although the precise ceUular function of 
TA is unknown, it seans clear that mutant TA (TAmut) acts through a 
dominant-negative or dominant-toxic mechanism (Breakefield, 2001). 

10 Several characteristics of DYTl make it an ideal disease in which to use 

siRNA-mediated gene silencing as therapy. Of greatest importance, the dominant 
nature of the dispase suggests that a reduction m mutant TA, whatever the 
precise pathogenic mechanism proves to be, is helpful. Moreover, the existence 
of a smgle common mutation that deletes a full three nucleotides suggested it 

1 5 might be feasible to design siRNA that specifically targets the mutant allele and 
is applicable to all affected persons. Finally, there is no effective therapy for 
DYTl, a relentless and disabling disease. 

As outlined in the strategy in Figure 1 1 , the inventors developed siRNA 
that would specifically eluninate production of protein from the mutant allele. 
20 By exploiting the three base pair difference between wild type and mutant 
alleles, the inventors successfully silenced expression of the mutant protein 
(TAmut) without interfering with expression of the wild type protein (TAwt). 
Because TAwt may be an essential protein it is critically hnportant that efforts be 
made to silence only the mutant allele. This allele-specific strategy has obvious 
25 therapeutic potential for DYTl and represents a novel and powerful research tool 
with which to investigate the fimction of TA and its dysfunction in the disease 
state; 

Expansions of poly-glutamine tracts in proteins that are expressed in the 
central nervous system can cause neinodegenerative diseases. Some 
30 neurodegenerative diseases are caused by a (CAG)n repeat that encodes poly- 
glutamine in a protein include Huntington disease (HD), spinocerebellar ataxia 
(SCAl, SCA2, SCA3, SCA6, SCA7), spinal and bulbar muscular atrophy 
(SBMA), and dentatorubropallidoluysian atrophy (DRPLA). In these diseases. 
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the poly-glutamine expaiisioa in a protein confers a novel toxic property upon 
the protein. Studies indicate that the toxic property is a tendency for the disease 
protem to misfold and form aggregates within neurons. 

The gene involved in Huntington's disease (IT-15) is located at the end of 
5 the short arm of chromosome 4. This gene is designated HD and encodes the 
protein huntingtin (also known as Htt). A mutation occurs in the coding region 
of this gene and produces an unstable expanded trinucleotide repeat 
(cytosine-adenosine-guanosine), resulting in a protein with an expanded 
glutamate sequence. The normal and abnormal fonctions of this protein (termed 
10 huntingtm) are unknown. The abnormal huntingtin protein appears to 

accumulate in neuronal nuclei of transgenic mice, but the causal relationship of 
this accumiilation to neuronal death is uncertain. 

One of skill in the art can select additional target sites for generating 
siElNA specific for other alleles beyond those specifically described in the 
15 experimental examples. Such allele-specific siKNAs made be designed using 
the guidelines provided by Ambion (Austin, TX). Briefly, the target cDNA 
sequence is scanned for target sequences that had AA di-nucleotides. Sense and 
anti-sense oligonucleotides are generated to these targets (AA + 3' adjacent 19 
nucleotides) that contained a G/C content of 35 to 55%. These sequences are 
20 then compared to others in the human genome database to minimize homology 
to other known coding sequences (BLAST search). 

To accomplish intracellular expression of the therapeutic siRNA, an 
RNA molecule is constructed containing two complementary strands or a hairpin 
sequence (such as a 21-bp hairpin) representing sequences directed against the 
25 gene of interest. The siKNA, or a nucleic acid encoding the siKNA, is 

introduced to the target cell, such as a diseased brain cell. The siRNA reduces 
target mRNA and protein expression. 

The construct encoding the therapeutic siRNA is configured such that the 
the one or more strands of the siRNA are encoded by a nucleic acid that is 
30 immediately contiguous to a promoter. In one example, the promoter is a pol n 
promoter. If a pol II promoter is used in a particular constmct, it is selected firom 
readily available pol n promoters known in the art, depending on whether 
regulatable, inducible, tissue or cell-specific expression of the siRNA is desired. 
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The construct is introduced into the target cell, such as by injection, allowing for 
duninished target-gene expression in the cell. 

It was surprising that a pol n promoter would be effective. While small 
RNAs with extensive secondary structure are routinely made fiom Pol HI 
S promoters, th^ is no a priori reason to assume that small interfering RNAs 
could be expressed from pol n promoters. Pol m promoters terminate in a short 
stretch of Ts (5 or 6), leaving a very small 3* end and allowing stabilization of 
secondary structure. Polymerase II transcription extends well past the coding 
and polyadenylation regions, after which the transcript is cleaved Two 

10 adenylation steps occur, leaving a transcript with a tail of up to 200 As. This 
string of As would of course completely destabilize any small, 21 base pair 
hairpin. Therefore, in addition to modifying the promoter to minimize sequences 
between the transcription start site and the siRNA sequence (thereby stabilizing 
the hairpin), the inventors also extensively modified the polyadenylation 

1 5 sequence to test if a very short polyadenylation could occur. The results, which 
were not predicted fi-om prior literature, showed that it could. 

The present invention provides an expression cassette containing an 
isolated nucleic acid sequence encoding a small interfering RNA molecule 
(slRNA) targeted against a gene of interest The siKNA may form a hairpin 

20 structure that contains a duplex stmcture and a loop stmcture. The loop structure 
may contain from 4 to 10 nucleotides, such as 4, 5 or 6 nucleotides. The duplex 
is less than 30 nucleotides in length, such as from 19 to 25 nucleotides. The 
siRNA may fiulher contain an overhang region. Such an oveihang may be a 3' 
overhang region or a 5' overhang region. The overhang region may be, for 

25 example, from 1 to 6 nucleotides in length. The expression cassette may further 
contain a pol 11 promoter, as described herein. Examples of pol n promoters 
include regulatable promoters and constitutive promoters. For example, the 
promoter may be a CMV or RS V promoter. The expression cassette may ftirther 
contain a polyadenylation signal, such as a synthetic minimal polyadenylation 

30 signal. The nucleic acid sequence may further contain a marker gene. The 
expression cassette may be contained in a viral vector. An appropriate viral 
vector for use in the present invention may be an adenoviral, lentiviral, adeno- 
associated viral (AAV), poliovirus, herpes simplex virus (HS V) or murine 
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Maloney-based viral vector. The gene of interest may be a gene associated with 
a condition amenable to siRNA ther^y. Examples of such conditions include 
neurodegenerative diseases, such as a trinucleotide-repeat disease (e.g., 
polyglutamine repeat disease). Examples of these diseases include Huntington's 
5 disease or several spinocerebellar ataxias. Alternatively, the gene of interest 
may encode a ligand for a chemokine involved in the migration of a cancer cell, 
or a chemokine receptor. 

The present invention also provides an expression cassette containing an 
isolated nucleic acid sequence encoding a first segment, a second segment 

10 located inamediately 3* of the first segment, and a third segment located 

immediately 3' of the second segment, wherein the first and third segments are 
each less than 30 base pairs in length and each more than 10 base pairs in length, 
and wherein the sequence of the third segment is the complement of the 
sequence of the first segment, and wherein the isolated nucleic acid sequence 

15 fimctions as a small interfering RNA molecule (siRNA) targeted against a gene 
of interest The expression cassette may be contained in a vector, such as a viral 
vector. 

The present invention provides a method of reducing the expression of a 
gene product in a cell by contacting a cell with an expression cassette described 

20 above. It also provides a method of treating a patient by administering to the 
patient a composition of the expression cassette described above. 

The present invention finther provides a method of reducing the 
expression of a gene product in a cell by contacting a cell with an expression 
cassette containing an isolated nucleic acid sequence encoding a first segment, a 

25 second segment located immediately 3' of the first segment, and a third segment 
located immediately 3' of the second segment, wherein the first and third 
segments are each less than 30 base pairs in length and each more than 10 base 
pairs in length, and wherein the sequence of the third segment is the complement 
of the sequence of the first segment, and wherein the isolated nucleic acid 

30 sequence functions as a small interfering RNA molecule (siRNA) targeted 
against a gene of interest. 

The present method also provides a method of treating a patient, by 
administering to the patient a composition containing an expression cassette, 
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wherein the expression cassette contains an isolated nucleic acid sequence 
encoding a fiist segment, a second segment located immediately 3* of the JBrst 
segment, and a third segment located unmediately 3' of the second segment, 
wherein the first and ttiird segments are each less than 30 bases in length and 
S each more than 10 bases in length, and wherein the sequence of the third 

segment is the complement of the sequence of the first segment, and wherein the 
isolated nucleic acid sequence functions as a small interfering SNA molecule 
(siKNA) targeted against a gene of interest 

10 L Definitions 

The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides 
and polymers thereof in either single- or double-stranded form, composed of 
monomers (nucleotides) containing a sugar, phosphate and a base that is either a 
purine or pyrimidine. Unless specifically limited, the term encompasses nucleic 
1 5 acids containing known analogs of natural nucleotides that have similar binding 
properties as the reference nucleic acid and are metabolized in a manner similar 
to naturally occurring nucleotides. Unless otherwise indicated, a particular 
nucleic acid sequence also encompasses conservatively modified variants thereof 
(e.g., degenerate codon substitutions) and complementary sequences, as well as 
20 the sequence explicitly indicated. Specifically, degenerate codon substitutions 
may be achieved by generating sequences in which the third position of one or 
more selected (or all) codons is substituted with mixed-base and/or deoxyinosine 
residues (Batzer et al, (1991); Ohtsuka et aL, (1985); Rossolini et ai, (1994)). 
A "nucleic acid Segment" is a portion of a given nucleic acid molecule. 
25 Deoxyribonucleic acid (DNA) in the majority of organisms is the genetic 

material while ribonucleic acid (RNA) is involved in the transfer of information 
contained within DNA into proteins. 

. The term "nucleotide sequence" refers to a polymer of DNA or RNA 
which can be single- or double-stranded, optionally containing synthetic, non- 
30 natural or altered nucleotide bases capable of incorporation into DNA or RNA 
polymers. 

The terms "nucleic acid", ''nucleic acid molecule", "nucleic acid 
firagment", "nucleic acid sequence or segment", or ''polynucleotide" are used 
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interchangeably and may also be used interchangeably with gene, cDNA, DNA 
and RNA encoded by a gene. 

The invention encompasses isolated or substantially purified nucleic acid 
or protein compositions, hi the context of the present invention, an "isolated" or 
5 "purified" DNA molecule or RNA molecule or an "isolated" or "purified" 

polypeptide is a DNA molecule, UNA molecule, or polypeptide that exists apart 
from its native environment and is therefore not a product of nature. An isolated 
DNA molecule, SNA molecule or polypeptide may exist in a purified form or 
may exist in a non-native environm^t such as, for exanq)le, a transgenic host 

10 cell. For example, an "isolated" or "purified" nucleic acid molecule or protein, 
or biologically active portion thereof is substantially free of other cellular 
material, or culture medium when produced by recombinant techniques, or 
substantially free of chemical precursors or o&er chemicals when chemically 
synthesized. In one embodiment, an "isolated" nucleic acid is free of sequences 

IS that naturally flank the nucleic acid (i,e,, sequences located at the 5' and 3' ends 
of the nucleic acid) in the genomic DNA of the organism fi:om which the nucleic 
acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0. 1 
kb of nucleotide sequences that naturally flank the nucleic acid molecule in 

20 genomic DNA of the cell from which the nucleic acid is derived, A protein that 
is substantially free of cellular material includes preparations of protein or 
polypeptide having less than about 30%, 20%, 10%, or 5% (by dry weight) of 
contaminating protein. When the protein of the invention, or biologically active 
portion thereof, is recombinantly produced, preferably culture mediimi 

25 represents less than about 30%, 20%, 10%, or 5% (by dry weight) of chemical 
precursors or non-protein-of-interest chemicals. Fragments and variants of the 
disclosed nucleotide sequences and proteins or partial-length proteins encoded 
thereby are also encompassed by the present invention. By "fragment" or 
"portion" is meant a fiill length or less than fiill length of the nucleotide sequence 

30 encoding, or the amino acid sequence of, a polypeptide or protein. 

The term "gene" is used broadly to refer to any segment of nucleic acid 
associated with a biological function. Thus, genes include coding sequences 
and/or the regulatory sequences required for their expression. For example. 
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"gene" refers to a nucleic acid fiagment that expresses mRNA, functional RNA, 
or specific protein, including regulatory sequences. "Genes" also include 
nonexpressed DNA segments that, for example, form recognition sequences for 
other proteins. "Genes" can be obtained fiom a variety of sources, including 
5 cloning fiom a source of interest or synthesizing from known or predicted 
sequence information, and may include sequences designed to have desired 
parameters. An "allele" is one of several altemative fomis of a gene occupying a 
given locus on a chromosome. 

"Naturally occurring" is used to describe an object that can be found in 
10 nature as distinct from being artificially produced. For example, a protein or 
nucleotide sequence present in an organism (including a virus), which can be 
isolated from a source in nature and which has not been intentionally modified 
by a person in the laboratory, is naturally occurring. 

The term "chimeric" refers to a gene or DNA that contains 1) DNA 
1 5 sequences, including regulatory and coding sequences, that are not found 
together in nature, or 2) sequences encoding parts of proteins not naturally 
adjoined, or 3) parts of promoters that are not naturally adjoined. Accordingly, a 
chimeric gene may include regulatory sequences and coding sequences that are 
derived from different sources, or include regulatory sequences and coding 
20 sequences derived from the same source, but arranged in a manner different from 
that found in nature. . 

A "transgene" refers to a gene that has been introduced into the genome 
by transformation. Transgenes include, for example, DNA that is either 
heterologous or homologous to the DNA of a particular cell to be transformed. 
25 Additionally, transgenes may include native genes inserted into a non-native 
organism, or chimeric genes. 

The term "endogenous gene" refers to a native gene in its natural location 
in the genome of an organism. 

A "foreign" gene refers to a gene not normally found in the host 
30 organism that has been introduced by gene transfer. 

The terms "protein," "peptide" and "polypeptide" are used 
interchangeably herein. 
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A "variant" of a molecule is a sequence that is substantially similar to ±q 
sequence of the native molecule. For nucleotide sequences, variants include 
those sequences that, because of the degeneracy of the genetic code, encode the 
identical amino acid sequence of the native protein. Naturally occurring allelic 
5 variants such as these can be identified with liie use of molecular biology 
techniques, as, for example, with polymarase chain reaction (PGR) and 
hybridization techniques. Variant nucleotide sequences also include 
synthetically derived nucleotide sequences, such as fliose generated, for 
example, by using site-directed mutagenesis, which encode the native protein, as 
10 well as those that encode a polype^jtide having amino acid substitutions. 

Generally, nucleotide sequence variants of the invention will have at least 40%, 
50%, 60%, to 70%. e.g., 71%. 72%, 73%, 74%, 75%, 76%. 77%, 78%, to 79%, 
generally at least 80%, e.g., 81%-84%, at least 85%, e.g., 86%, 87%, 88%, 89%, 
90%, 91%, 92%. 93%. 94%. 95%, 96%, 97%, to 98%, sequence identity to the ' 
1 5 native (endogenous) nucleotide sequence. 

"ConsCTvatively modified variations" of a particular nucleic acid 
sequence refers to those nucleic acid sequences that encode identical or 
essentially identical amino acid sequences. Because of the degeneracy of the 
genetic code, a large number of fimctionally identical nucleic acids encode any 
20 given polypeptide. For instance, the codons CGT, CGC, CGA, CGG, AGA and 
AGG aU encode the amino acid arginine. Thus, at every position where an 
arginine is specified by a codon. the codon can be altered to any of the 
corresponding codons described without altering the encoded protein. Such 
nucleic acid variations are "silent variations." which are one species of 
15 "conservatively modified variations." Every nucleic acid sequence described 
herein that encodes a polypeptide also desaibes every possible silent variation, 
except where otherwise noted. One of skill in the art wiU recognize that each 
codon in a nucleic acid (except ATG, which is ordinarily the only codon for 
methionine) can be modified to yield a fimctionaUy identical molecule by 
0 standard techniques. Accordingly, each "silent variation" of a nucleic acid that 
encodes a polypeptide is implicit in each described sequence. 

"Recombinant DNA molecule" is a combination of DNA sequences that 
are joined together using recombinant DNA technology and procedures used to 
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join together DNA sequences as described, for exanq)le, in Sambrook and 
RusseU (2001). 

The terms 'lieterologous gene", •'heterologous DNA sequence", 
"exogenous DNA sequence", •'heterologous RNA sequence", "exogenous RNA 
5 sequaice" or •'hetCToIogous nucleic acid" each refer to a sequence fliat eitiiCT 
originates ftom a source foreign to the particular host ceU, or is from flie same 
source but is modified fiom its original or native form. Thus, a heterologous 
gene in a host cell includes a gene fliat is endogenous to flie particular host cell 
but has been modified through, for example, the use of DNA shuffling. The 

10 terms also include non-naturally occurring multiple copies of a naturally 
occurring DNA or RNA sequence. Thus, the terms refer to a DNA or RNA 
segment that is foreign or heterologous to the cell, or homologous to the ceU but 
in a position within flie host cell nucleic add in which the element is not 
ordinarily found. Exogenous DNA segments are e^ressed to yield exogenous 

15 polypeptides. 

A "homologous" DNA or RNA sequence is a sequence that is naturaUy 
associated wifli a host cell into which it is introduced. 

"Wild-type" refers to flie normal gene or organism found in nature. 
"Genome" refers to flie complete genetic material of an organism. 
20 A "vector" is defined to include, inter alia, any viral vector, as well as 

any plasmid, cosmid, phage or binary vector in double or single stranded linear 
or circular form tiiat may or may not be self transmissible or mobilizable, and 
fliat can transform prokaryotic or eukaryotic host eiflier by integration into the 
cellular gaiome or exist extrachromosomally (e.g., autonomous rephcating 
25 plasmid with an origin of replication). 

"Expression cassette" as used herein means a nucleic acid sequence 
capable of directing expression of a particular nucleotide sequence in an 
appropriate host cell, which may include a promoter operably linked to the 
nucleotide sequence of interest fliat may be operably linked to termination 
30 signals. It also may include sequences required for proper translation of the 
nucleotide sequence. The coding region usually codes for a protein of interest 
but may also code for a fimctional RNA of interest, for example an antisense 
RNA, a nontranslated RNA in tiie sense or antisense direction, or a siKNA. The 
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expression cassette including the nucleotide sequence of interest may be 
chimCTic. The expression cassette may also be one that is naturally occurring but 
has been obtained in a recombinant fomi useful for heterologous e;q)ression. 
The ejqjression of the nucleotide sequence in the expression cassette may be 
5 under the control of a constitutive promoter or of an regulatable promoter that 
initiates transcription only ^en the host cell is exposed to some particular 
stimulus. In the case of a multicellular organism, the promoter can also be 
specific to a particular tissue or organ or stage of development. 

Such expression cassettes can include a transcriptional initiation region 
10 linked to a nucleotide sequence of interest. Such an egression cassette is 

provided witii a pluraUly of restriction sites for insertion of the gene of interest to 
be under the transcriptional regulation of the regulatory regions. The expression 
cassette may additionally contain selectable marker genes. 

"Coding sequence" refers to a DNA or RNA sequence tiiat codes for a 
15 specific amino acid sequence. It may constitute an "uninterrupted coding 

sequence", /.e., lacking an intron, such as in a cDNA, or it may include one or 
' more introns bounded by appropriate splice junctions. An "intron" is a sequence 
of RNA fliat is contained in the primary transcript but is removed tiirough 
cleavage and re-Ugation of the RNA within the cell to create tiie mature mRNA 
20 that can be translated into a protein. 

The term "open reading fi^ie" (ORF) refers to tiie sequence between 
translation initiation and termination codons of a coding sequence. The terms 
"initiation codon" and "termination codon" refer to a unit of three adjacent 
nucleotides (a 'codon') in a coding sequence that specifies initiation and chain 
25 termination, respectively, of protein synthesis (mRNA translation). 

"Functional RNA" refers to sense RNA, antisense RNA, ribozyme RNA, 
siRNA, or other RNA that may not be translated but yet has an effect on at least 
one cellular process. 

The term "RNA transcript" refers to the product resulting from RNA 
30 polymerase catalyzed transcription of a DNA sequence. When the RNA 

transcript is a perfect complementary copy of the DNA sequence, it is referred to 
as the primary transcript or it may be a RNA sequence derived fi-om 
posttranscriptional processing of flie primary transcript and is referred to as the 
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mature RNA. "Messenger RNA" (mRNA) refers to the RNA that is without 
introns and that can be translated into protein by the cell "cDNA" refers to a 
single- or a double-stranded DNA that is complementary to and derived &om 
mKNA. 

5 "Regulatory sequences" and "suitable regulatory sequences" each refer to 

nucleotide sequences located upstream (5' non-coding sequences), within, or 
downstream (3' non-coding sequences) of a coding sequence, and which 
influence the transcription. RNA processing or stability, or translation of the 
associated coding sequence. Regulatory sequences include enhancers, 
10 promoters, translation leader sequences, introns, and polyadenylation signal 
sequences. They include natural and synthetic sequences as well as sequences 
that may be a combination of synthetic and natural sequences. As is noted 
above, the term "suitable regulatory sequences" is not limited to promoters. 
However, some suitable regulatory sequences useful in the present invention wiU 
15 include, but are not limited to constitutive promoters, tissue-specific promoters, 
developmait-spedfic promoters, regulatable promoters and viral promoters. 
Examples of promoters that maybe used in the present invention include CMV, 
RSV, polU and poM promoters. 

"5' non-coding sequence" refers to a nucleotide sequence located 5' 
20 (upstream) to the coding sequence. It is present in the tully processed mRNA 
upstream of the initiation codon and may affect processing of the primary 
transcript to mRNA, mRNA stabiUty or translation efficiency (Turner et al., 
1995). 

"3' non-coding sequence" refers to nucleotide sequences located 3' 
25 (downstream) to a coding sequence and may include polyadenylation signal 
sequences and other sequences encoding regulatory signals capable of affecting 
mRNA processing or gene expression. The polyadenylation signal is usually 
chaiactaized by affecting the addition of polyadenylic acid tracts to the 3' end of 
Qie mRNA precursor. 
30 The term "translation leader sequence" refers to that DNA sequence 

portion of a gene between the promoter and coding sequence that is transcribed 
into RNA and is present m the fully processed mRNA upstream (5") of the 
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translation start codon. The translation leader sequence may affect processing of 
the primary transcript to mRNA, mRNA stabiHty or translation efficiency. 

The term "mature" protein refers to a post-translationally processed 
polypeptide without its signal peptide. "Precursor" protein refers to the primary 
5 product of translation of an mRNA. "Signal peptide" refers to the amino 

terminal extension of a polypeptide, which is translated in conjunction witii the 
polypeptide forming a precursor peptide and which is required for its entrance 
into tiie secretory pathway. The term "signal sequence" refers to a nucleotide 
sequence that encodes the signal peptide. 
10 "Promoter" refers to a nucleotide sequence, usually upstream (S") to its 

coding sequence, v/idch directs and/or controls the expression of the coding 
sequence by providing the recognition for RNA polymerase and other factors 
required for proper transcription. "Promoter" includes a minimal promoter that is 
a short DNA sequence comprised of a TATA- box and other sequences that 
15 serve to specify the site of transcription initiation, to which regulatory elements 
are added for control of expression. "Promoter" also refers to a nucleotide 
sequence that includes a minimal promoter plus regulatory elements that is 
capable of controlling the expression of a coding sequence or functional RNA. 
This type of promoter sequence consists of proximal and more distal upstream 
20 elements, the latter elements often referred to as enhancers. Accordingly, an 
"enhancer" is a DNA sequence tiiat can stimulate promoter activity and may be 
. an innate element of the promoter or a heterologous element inserted to enhance 
the level or tissue specificity of a promoter. It is capable of operating in botii 
orientations (normal or flipped), and is capable of functioning even when moved 
25 either upstream or downstream &om the promoter. Both enhancers and other 
upstream promoter elements bind sequence-specific DNA-binding proteins that 
mediate their effects. Promoters may be derived in tiieu- entirety from a native 
gene, or be composed of diflferent elements derived torn different promoters 
found in nature, or even be comprised of synthetic DNA segments. A promoter 
30 may also contam DNA sequences that are involved m tiie bmding of protein 
factors that control the eJBfectiveness of transcription initiation in response to 
physiological or developmental conditions. 
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The "initiation site" is the position surrounding the first nucleotide that is 
part of the transcribed sequence, which is also defined as position +1. With 
respect to this site all other sequences of the gene and its controlling regions are 
numbered. Downstream sequences (z.e., further protein encoding sequences in 
5 the 3* direction) are denominated positive, while upstream sequences (mostly of 
the controlling regions in the 5' direction) are denominated negative. 

Promoter elements, particularly a TATA element, that are inactive or that 
have greatly reduced promoter activity in the absence of upstream activation are 
referred to as "minimal or core promoters." In the presence of a suitable 

10 transcription factor, the niinimal promoter flmctions to permit transcription. A 
"minimal or core promoter" thus consists only of all basal elements needed for 
transcription initiation, e.g., a TATA box and/or an initiator. 

"Constitutive expression" refers to expression usmg a constitutive or 
regulated promoter. "Conditional" and "regulated expression" refer to 

1 5 expression controlled by a regulated promoter. 

"Operably-linked" refers to the association of nucleic acid sequences on 
single nucleic acid fi:^gment so that the function of one of the sequences is 
affected by another. For example, a regulatory DNA sequence is said to be 
"operably linked to" or "associated with" a DNA sequence that codes for an 

20 RNA or a polypeptide if the two sequences are situated such that the regulatory 
DNA sequence affects expression of the coding DNA sequence (i.e., that the 
coding sequence or functional RNA is under the transcriptional control of the 
promoter). Coding sequences can be operably-linked to regulatory sequences in 
sense or antisense orientation. 

25 "Expression" refers to the transcription and/or translation of an 

endogenous gene, heterologous gene or nucleic acid segment, or a transgene in 
cells. For example, in the case of siRNA constructs, expression may refer to the 
transcription of the siRNA only. In addition, expression refers to the 
transcription and stable accumulation of sense (mRNA) or functional RNA. 

30 Expression may also refer to the production of protein. 

"Altered levels" refers to the level of expression in transgenic cells or 
organisms that differs firom that of normal or untransformed cells or organisms. 
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"Overexpression" refers to the level of expression in transgenic cells or 
organisms that exceeds levels of expression in normal or untransfonned cells or 
organisms. 

"Antisense inhibition" refers to the production of antisense RNA 
transcripts capable of suppressing the expression of protein from an endogenous 
gene or a transgene. 

"lYanscription stop fragment" refers to nucleotide sequences that contain 
one or more regulatory signals, such as polyadenylation signal sequences, 
capable of temiinating transcription. Examples include the 3' non-regulatory 
regions of genes encoding nopaline synthase and the smaU subunit of ribulose 
bisphosphate carboxylase. 

"Translation stop fragment" refers to nucleotide sequences that contain 
one or more regulatory signals, such as one or more tennination codons in all 
three frames, capable of terminating translation. Insertion of a translation stop 
15 fragment adjacent to or near the initiation codon at the 5' end of the coding 
sequence wiU result in no translation or improper translation. Excision of the 
translation stop fragment by site-specific recombination wiU leave a site-specific 
sequeuce in the coding sequence that does not interfere with proper translation 
using tiie initiation codon. 

The tarns "ciy-acting sequence" and "c«-acting element" refer to DNA 
or RNA sequences whose functions require them to be on the same molecule. 
An example of a cis-acting sequence on the repUcon is the viral repUcation 
origin. 

The terms "fra/ur-acting sequence" and "trans-acting element" refer to 
25 DNA or KNA sequences whose function does not require them to be on the same 
molecule. 

"Chromosomally-integrated" refers to the integration of a foreign gene or 
nucleic acid construct into the host DNA by covalent bonds. Where genes are 
not "chromosomally mtegrated" they may be "transiently expressed." Transient 
30 expression of agene refers to the expression of a gene that is not integrated into 
the host chromosome but functions independently, either as part of an 
autonomously replicating plasmid or expression cassette, for example, or as part 
of anollier biological system such as a virus. 



20 



29 



wo 2004/013280 



PCT/US2003/016887 



The following terms are used to describe the sequence relationships 
between two or more nucleic acids or polynucleotides: (a) "reference sequence", 
(b) "comparison window", (c) "sequence identity", (d) "percentage of sequence 
identity", and (e) "substantial identity". 
5 (a) As used herein, "reference sequence" is a defined sequence used as a 

basis for sequence comparison. A reference sequence may be a subset or the 
entirety of a specified sequence; for example, as a segment of a full-length 
cDNA or ger^e sequence, or the complete cDNA or gene sequence. 

(b) As used herein, "comparison window" makes reference to a 
1 0 contiguous and specified segment of a polynucleotide sequence, wherein the 
polynucleotide sequence in the comparison window may comprise additions or 
deletions (z.e., gaps) compared to the reference sequence (which does not 
comprise additions or deletions) for optimal ahgnment of the two sequences. 
Generally, the comparison window is at least 20 contiguous nucleotides in 
1 5 length, and optionally can be 30, 40, 50, 1 00, or longer. Those of skill in the art 
understand that to avoid a high similarity to a reference sequence due to 
inclusion of gaps in the polynucleotide sequence a gap penalty is typically 
introduced and is subtracted firom the number of matches. 

Methods of alignment of sequences for comparison are well-known in 
20 the art. Thus, the determination of percent identity between any two sequences 
can be accomplished using a mathematical algorithm. Preferred, non-hmiting 
examples of such mathematical algorithms are the algorithm of Myers and Miller 
(1988); the local homology algorithm of Smith et al (1981); the homology 
alignment algorithm of Needleman and Wunsch (1970); the search-for- 
25 similarity-method of Pearson and Lipman (1988); the algorithm of KarUn and 
Altschul (1990), modified as in Karlin and Altschul (1993). 

Computer implementations of these mathematical algorithms can be 
utilized for comparison of sequences to detemiine sequence identity. Such 
implementations include, but are not limited to: CLUSTAL in the PC/Gene 
30 program (available fi-om IntelUgenetics, Mountain View, California); the ALIGN 
program (Version 2.0) and GAP, BESTFIT, BLAST, FASTA, and TFASTA in 
the Wisconsin Genetics Software Package, Version 8 (available fi-om Genetics 
Computer Group (GCG), 575 Science Drive, Madison, Wisconsin, USA). 
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Alignments using these programs can be performed using the default parameters. 
The CLUSTAL program is well described by Higgins et al (1988); Higgins et 
al (1989); Corpet et al (1988); Huang et al (1992); and Pearson et al (1994). 
The ALIGN program is based on the algorithm of Myers and Miller, supra. The 
5 BLAST programs of Altschvd et al (1990), are based on the algorithm of Karlin 
and Altschul supra. 

Software for performing BLAST analyses is publicly available through 
the National Center for Biotechnology Information 

(http://www.ncbi.nlm.nikgov/). This algorithm involves first identifying high 

10 scoring sequence pairs (HSPs) by identifying short words of length W in the 
query sequence, which either match or satisfy some positive- valued threshold 
score T when aligned with a word of the same length in a database sequence. T 
is referred to as the neighborhood word score threshold. These initial 
neighborhood word hits act as seeds for initiating searches to fmd longer HSPs 

1 5 containing them. The word hits are then extended in both directions along each 
sequence for as far as the cumulative aligmnent score can be increased. 
Cumulative scores are calculated using, for nucleotide sequences, the parameters 
M (reward score for a pair of matching residues; always > 0) and N (penalty 
score for mismatching residues; always < 0). For amino acid sequences, a 

20 scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when the cumulative ahgmnent score falls off by 
the quantity X firom its maximum achieved value, the cumulative score goes to 
zero or below due to the accumulation of one or more negative-scoring residue 
alignments, or the end of either sequence is reached. 

25 In addition to calculating percent sequence identity, the BLAST 

algorithm also performs a statistical analysis of the similarity between two 
sequences. One measure of similarity provided by the BLAST algorithm is the 
smallest sum probability (P(N))> which provides an mdication of the probability 
by which a match between two nucleotide or amino acid sequences would occur 

30 by chance. For example, a test nucleic acid sequence is considered similar to a 
reference sequence if the smallest sum probabiUty in a comparison of the test 
nucleic acid sequence to the reference nucleic acid sequence is less than about 



31 



wo 2004/013280 



PCT/US2003/016887 



0,1, more preferably less than about 0.01, and most preferably less than about 
0,001. 

To obtain gapped alignments for comparison purposes, Gapped BLAST 
(in BLAST 2.0) can be utilized as described in Altschul et al (1997). 
5 Alternatively, PSI-BLAST (in BLAST 2.0) can be used to perform an iterated 
search that detects distant relationships between molecules. See Altschul et al. , 
supra. When utilizing BLAST, Gapped BLAST, PSI-BLAST, the default 
parameters of the respective programs {e.g. BLASTN for nucleotide sequences, 
BLA.STX for proteins) can be used. The BLASTN program (for nucleotide 

10 sequences) uses as defeults a wordlength (W) of 1 1, an expectation (E) of 10, a 
cutoffof 100, M=5, N=-4, and a comparison of both strands. For amino acid 
sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an 
expectation (E) of 10, and the BLOSUM62 scoring matrix. See 
http://www,ncbi,nlm.nih.gov. AUgmnent may also be performed manually by 

IS inspection. 

For purposes of the present invention, comparison of nucleotide 
sequences for determination of percent sequence identity to the promoter 
sequences disclosed herein is preferably made using the BlastN program 
(version 1.4.7 or later) with its defiiult parameters or any equivalent program. 
20 By "equivalent program" is intended any sequence comparison program that, for 
any two sequences in question, generates an alignment having identical 
nucleotide or amino acid residue matches and an identical percent sequence 
identity when compared to the corresponding aligmnent generated by the 
preferred program. 

25 (c) As used herein, "sequence identity' or "identity" in the context of two 

nucleic acid or polypeptide sequences makes reference to a specified percentage 
of residues in the two sequences that are the same when aligned for maximum 
correspondence over a specified comparison window, as measured by sequence 
comparison algorithms or by visual mspection. When percentage of sequence 

30 identity is used in reference to proteins it is recognized that residue positions 
which, are not identical often differ by conservative amino acid substitutions, 
where amino acid residues are substituted for other amino acid residues with 
similar chemical properties (e.^,, charge or hydrophobicity) and therefore do not 
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change the functional properties of tiie molecule. When sequences differ in 
conservative substitutions, the percent sequence identity may be adjusted 
upwards to correct for the conservative nature of the substitution. Sequences 
tiiat differ by such conservative substitutions are said to have "sequence 
5 similarity" or "similarity." Means for making this adjustment are well knovra to 
those of skill in the art. Typically this involves scoring a conservative 
substitution as a partial rather than a full mismatch, thereby increasing the 
percentage sequence identity. Thus, for example, where an identical amino acid 
is given a score of 1 and a non-conservative substitution is given a score of zero, 
10 a conservative substitution is given a score between zero and 1 . The scoring of 
conservative substitutions is calculated, e.g., as implemented in the program 
PC/GENE (Intelligenetics, Mountain View, California). 

(d) As used herein, "percentage of sequence identity" means the value 
determined by comparing two optimally aUgned sequences over a comparison 

15 window, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions (z.e., gaps) as compared to the 
reference sequence (which does not comprise additions or deletions) for optimal 
aligmnent of the two sequences. The percentage is calculated by determining the 
number of positions at which the identical nucleic acid base or amino acid 

20 residue occurs in both sequences to yield the number of matched positions, 

dividing the number of matched positions by the total number of positions in the 
window of comparison, and multiplying the result by 100 to yield the percentage 
of sequence identity. 

(e) (i) The term "substantial identity" of polynucleotide sequences means 
25 that a polynucleotide comprises a sequence that has at least 70%, 71%, 72%, 

73%, 74%, 75%, 76%, 77%, 78%, or 79%, preferably at least 80%, 81%, 82%, 
83%, 84%, 85%, 86%, 87%, 88%, or 89%, more preferably at least 90%, 91%, 
92%, 93%, or 94%, and most preferably at least 95%, 96%, 97%, 98%, or 99% 
sequence identity, compared to a reference sequence using one of the alignment 
30 programs described using standard parameters. One of skill in the art will 
recognize that these values can be appropriately adjusted to determine 
corresponding identity of proteins encoded by two nucleotide sequences by 
taking into account codon degeneracy, amino acid similarity, reading frame 
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positioning, and tbe like. Substantial idesntity of amino acid sequences for these 
purposes normaUy means sequence identity of at least 70%, more preferably at 
least 80%, 90%, and most preferably at least 95%. 

Anotiier indication fliat nucleotide sequences are substantiaUy identical is 
5 if two molecules hybridize to each oflier under stringent conditions. Generally, 
stiingent conditions are selected to be about 5°C lower than tiie themal melting 
pomt (Tm) for the specific sequence at a defined ionic stiength and pH. 
However, stiingent conditions encompass ten^jeratures in the range of about 1 °C 
to about 20''C, depending upon the desired degree of sfaingency as otherwise 
10 qualified herein. Nucleic acids tiiat do not hybridize to each other under stringent 
conditions are stiU substantially identical if the polypeptides diey encode are 
substantiaUy identical This may occur, e.g. , when a copy of a nucleic acid is 
created using the maximum codon degeneracy permitted by the genetic code. 
One mdication that two nucleic acid sequences are substantially identical is 
when tile polypeptide encoded by the first nucleic acid is immunologically cross 
reactive wifli flie polypeptide oicoded by flie second nucleic acid. 

(e)(ii) The term "substantial identity" in the context of a peptide indicates 
fliat a pq)tide comprises a sequence with at least 70%, 71%, 72%, 73%, 74%, 
75%, 76%, 77%, 78%, or 79%, preferably 80%, 81%, 82%, 83%, 84%, 85%, 
20 86%, 87%, 88%, or 89%, more preferably at least 90%, 91%, 92%, 93%, or 
94%, or even more preferably, 95%, 96%, 97%, 98% or 99%, sequence identity 
to flie reference sequence over a specified comparison window. Preferably, 
optimal aUgnment is conducted using the homology ahgnment algorithm of 
Needleman and Wunsch (1970). An mdication that two peptide sequences are 
25 substantially identical is tiiat one peptide is inununologicaUy reactive with 
antibodies raised against the second peptide. Thus, a peptide is substantially 
identical to a second peptide, for example, where tiie two peptides differ only by 
a conservative substitution. 

For sequence comparison, typically one sequence acts as a referaice 
30 sequraice to which test sequences are compared. When using a sequence 

comparison algoritiun, test and reference sequences are input into a computer, 
subsequaice coordmates are designated if necessary, and sequence algorithm 
program parameters are designated. The sequence comparison algorithm then 
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calculates the percent sequence identity for the test sequence(s) relative to the 
reference sequence, based on the designated program parameters. 

As noted above, another indication that two nucleic acid sequences are 
substantiaUy identical is that the two molecules hybridize to each other under 
5 stringent conditions. The phrase "hybridizing specifically to" refers to the 
binding, duplexing, or hybridizing of a molecule only to a particular nucleotide 
sequence under stringent conditions v/hea that sequence is present in a complex 
mixture (e.g., total cellular) DNA or RNA. **Bind(s) substantially" refers to 
complementary hybridization between a probe nucleic acid and a target nucleic 
10 acid and embraces minor mismatches that can be accommodated by reducing the 
stringency of the hybridization media to achieve the desired detection of the 
target nucleic acid sequence. 

"Stringent hybridization conditions" and "stringent hybridization wash 
conditions" in the context of nucleic acid hybridization experiments such as 
15 Southern and Northern hybridizations are sequence dependent, and are different 
under different environmental parameters. Longer sequences hybridize 
specifically at higher temperatures. The Tm is the temperature (under defined 
ionic strength and pH) at which 50% of the .target sequence hybridizes to a 
perfectly matched probe. Specificity is typically the fimction of post- 
20 hybridization washes, the critical factors being the ionic strength and 

temperature of the final wash solution. For DNA-DNA hybrids, the Tm can be 
approximated from the equation of Meinkoth and Wahl (1984); Tm 81,5°C + 
16.6 (log M) +0.41 (%GC) - 0.61 (% form) - 500/L; where M is the molarity of 
monovalent cations, %GC is the percentage of guanosine and cytosine 
25 nucleotides in the DNA, % form is the percentage of formamide in the 

hybridization solution, and L is the length of the hybrid in base pairs. Tm is 
reduced by about 1°C for each 1% of mismatching; thus, Tm, hybridization, 
and/or wash conditions can be adjusted to hybridize to sequences of the desired 
identity. For example, if sequences with >90% identity are sought, the Tm can 
30 be decreased lO^'C. Generally, stringent conditions are selected to be about 5°C 
lower than the thermal melting point (Tm) for the specific sequence and its 
complement at a defined ionic strength and pH. However, severely stringent 
conditions can utilize a hybridization and/or wash at 1, 2, 3, or 4°C lower than 
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the ttiennal melting point (Jndl moderately stringent conditions can utilize a 
hybridization and/or wash at 6, 7, 8, 9, or 10«C lower than the thermal melting 
point (Tm); low stringency conditions can utilize a hybridization and/or wash at 
11, 12, 13, 14, 15, or 20°C lower than the thermal melting point (T„0- Using the 
5 equation, hybridization and wash compositions, and desked T, those of ordinary 
skill will understand that variations in the stringency of hybridization and/or 
wash solutions are inherently described. Ifthe desired degree of mismatching 
results in a T of less than 45°C (aqueous solution) or 32°C (formamide 
solution), it is preferred to increase the SSC concentration so that a higher 
10 temperature can be used. An extensive guide to the hybridization of nucleic 
acids is found in Tijssen (1993). GeneraUy, highly stringent hybridization and 
wash conditions are selected to be about 5°C lower than the thermal melting 
point (TnO for the specific sequence at a defined ionic strength and pH. 

An example of highly stringent wash conditions is 0.15 M NaCl at 72^C 
15 for about 15 minutes. An example of stringent wash conditions is a 0.2X SSC 
wash at 65°C for 15 minutes (see, Sambrook and Russell, infra, for a description 
of SSC buffer). Often, a high stringency wash is preceded by a low stringency 
wash to remove background probe signal. An example medium stringency wash 
for a duplex o^ e.g., more than 100 nucleotides, is IX SSC at 45°C for 15 
20 minutes. An example low stringency wash for a duplex of, e.g„ more than 1 00 
nucleotides, is 4-6X SSC at 40°C for 15 minutes. For short probes (e.g., about 
10 to 50 nucleotides), stringent conditions typically involve salt concentrations 
of less than about 1.5 M, more preferably about 0.01 to 1.0 M, Na ion 
concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at 
25 least about 30°C and at least about 60T for long probes (e.g., >50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. In general, a signal to noise ratio of 2X (or higher) 
than that observed for an unrelated probe in the particular hybridization assay 
mdicates detection of a specific hybridization. Nucleic acids that do not 
30 hybridize to each other under stringent conditions are still substantially identical 
ifthe proteins that they encode are substantially identical. This occurs, e.g. , 
when a copy of a nucleic acid is created using the maximum codon degeneracy 
permitted by the genetic code. 
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Very stringent conditions are selected to be equal to the for a 
particular probe. An example of stringent conditions for hybridization of 
complementary nucleic acids which have more than 100 complementary residues 
on a filter in a Southern or Northern blot is 50% fonnamide, e.g,, hybridization 
5 in 50% fonnamide, 1 M NaCl, 1% SDS at ST'C, and a wash m O.IX SSC at 60 
to 65°C. Exemplary low stringency conditions include hybridization with a 
buflfer solution of 30 to 35% formamide, IM NaCl, 1% SDS (sodium dodecyl 
sulfate) at SV^'C, and a wash in IX to 2X SSC (20X SSC = 3.0 M NaCl/0.3 M 
trisodium citrate) at 50 to SS^'C. Bcenq)lary moderate stringency conditions 

10 include hybridization in 40 to 45% fonnamide, 1.0 M NaCl, 1% SDS at 37°C, 
and a wash in 0,5X to IX SSC at 55 to 60°a 

By "variant" polypeptide is intended a polypeptide derived from the 
native protein by deletion (also called '^truncation") or addition of one or more 
amino acids to the N-terminal and/or C-tenninal end of the native protein; 

15 deletion or addition of one or more amino acids at one or more sites in the native 
protein; or substitution of one or more amino acids at one or more sites in the 
native protein. Such variants may results from, for example, genetic 
polymorphism or from human manipulation. Methods for such manipulations 
are generally known in the art. 

20 Thus, the polypeptides of the invention may be altered in various ways 

including amino acid substitutions, deletions, truncations, and insertions. 
Methods for such manipulations are generally known in the art. For example, 
amino acid sequence variants of the polypeptides can be prepared by mutations 
in the DNA. Methods for mutagenesis and nucleotide sequence alterations are 

25 well known m the art. See, for example, Kunkel (1985); Kunkel et al (1987); U. 
S. Patent No. 4,873,192; Walker and Gaastra (1983), and the references cited 
therein. Guidance as to appropriate amino acid substitutions that do not affect 
biological activity of the protein of interest may be found in the model of 
Dayhoff aL (1978). Conservative substitutions, such as exchanging one 

30 amino acid with another having sunilar properties, are preferred. 

Thus, the genes and nucleotide sequences of the invention include both 
the natiually occurring sequences as well as variant forms. Likewise, the 
polypeptides of the invention encompass both naturally occurring proteins as 
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well as variations and modified forms thereof. Such variants will continue to 
possess the desired activity. The deletions, insertions, and substitutions of the 
polypeptide sequence encompassed herein are not ejcpected to produce radical 
changes in the characteristics of the polypeptide. However, when it is difficult to 
5 predict the exact effect of the substitution, deletion, or insertion in advance of 
doing so, one skilled in the art will appreciate that the effect will be evaluated by 
routine screening assays. 

Individual substitutions deletions or additions that alter, add or delete a 
single amino acid or a small percentage of amino acids (typically less than 5%, 
10 more typically less than 1%) in an encoded sequence are "conservatively 

modified variations," where the alterations result in the substitution of an amino 
acid with a chemically similar amino acid. Conservative substitution tables 
providing fimctionally similar amino acids are well known in the art. The 
following five groups each contain amino acids that are conservative 
15 substitutions for one another: Aliphatic: Glycine (G), Alanine (A), Valine (V), 
Leucine (L), Isoleucine (I); Aromatic: Phenylalanine (F), Tyrosine (Y), 
Tryptophan (W); Sulfur-containing: Methionine (M), Cysteine (C); Basic: 
Arginine (R), Lysine (K), Histidine (H); Acidic: Aspartic acid (D), Glutamic 
acid (E), Asparagine (N), Glutamine (Q). In addition, individual substitutions, 
20 deletions or additions which alter, add or delete a single amino acid or a small 
percentage of amino acids in an encoded sequence are also "conservatively 
modified variations." 

The term "transformation" refers to the transfer of a nucleic acid 
fragment into the genome of a host cell, resulting in genetically stable 
25 inheritance. A *liost cell" is a cell that has been transformed, or is capable of 
transformation, by an exogenous nucleic acid molecule. Host cells containing the 
transfomied nucleic acid fiiagments are referred to as "transgenic" cells, and 
organisms comprising transgenic cells are referred to as "transgenic organisms". 
^Transformed", '^transduced", "transgenic", and 'Recombinant" refer to a 
30 host cell or organism into which a heterologous nucleic acid molecule has been 
introduced. The nucleic acid molecule can be stably integrated into the genome 
generally known in the art and are disclosed in Sambrook and Russell, infra. 
See also Innis et al (1995); and Gelfand (1995); and hmis and Gelfand (1999). 
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Known methods of PGR include, but are not limited to, methods using paired 
primers, nested primers, single specific primers, degenerate primers, gene- 
specific primers, vector-specific primers, partially mismatched primers, and the 
like. For example, "transformed," "transformant," and "transgenic" cells have 
5 been through the transformation process and contain a foreign gene integrated 
into their chromosome. The term "untransformed" refers to normal ceUs that 
have not been througji the transformation process. 

A "transgenic" organism is an organism having one or more cells that 
contain an expression vector. 
10 "Genetically altered cells'* denotes ceUs which have been modified by the 

introduction of recombinant or heterologous nucleic acids (e,g,, one or more 
DNA constructs or their RNA counterparts) and further includes the progeny of 
such cells which retain part or all of such genetic modification. 

The term "fiision protein" is intended to describe at least two 
15 polypeptides, typically from different sources^ which are operably linked. With 
regard to polypeptides, the term operably linked is intended to mean that the two 
polypeptides are connected in a manner such that each polypeptide can serve its 
intended fimction. Typically, tiie two polypeptides are covalentiy attached 
fluough peptide bonds. The fiision protein is preferably produced by standard 
20 recombinant DNA techniques. For example, a DNA molecule encoding the first 
polypeptide is ligated to another DNA molecule encoding the second 
polypeptide, and the resultant hybrid DNA molecule is expressed in a host cell to 
produce tiie fusion protein. The DNA molecules are hgated to each other in a 5' 
to 3' orientation such that, after ligation, the translational frame of tiie encoded 
25 polypeptides is not altered the DNA molecules are ligated to each other in- 
frame). 

As used herein, the term "derived" or "directed to" witii respect to a 
nucleotide molecule means that the molecule has complementary sequence 
identity to a particular molecule of interest. 
30 "Gene silencing" refers to the suppression of gene expression, e.g., 

transgene, heterologous gene and/or endogenous gene expression. Gene 
silencing may be mediated through processes that affect transcription and^or 
through processes that affect post-transcriptional mechanisms. In some 
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embodiments, gene silencing occurs when siKNA initiates the degradation of the 
mRNA of a gene of interest in a sequence-specific manner via RNA iaterference 
(for a review, see Brantl, 2002). In some embodiments, gene silencing may be 
allele-specific. "Allele-specific" gene silencing refers to the specific silencing of 
S one allele of a gene. 

"Knock-down," "knock-down technology^ refers to a technique of gene 
silencing in which the expression of a target gene is reduced as compared to the 
gene expression prior to the introduction of the siRNA, which can lead to the 
inhibition of production of the target gene product. The term ^'reduced" is \xsed 
10 hereitt to indicate that the target gene expression is lowered by 1-100%, For 

example, the expression may be reduced by 10, 20, 30, 40, 50, 60, 70, 80, 90, 95, 
or even 99%. Knock-down of gene expression can be directed by the use of 
dsRNAs or siRNAs. For example, '"KNA interference (RNAi)," which can 
involve the use of siRNA, has been successfully appUed to knockdown the 
15 expression of specific genes in plants, D. melanogaster, C. elegatis, 

trypanosomes, planaria, hydra, and several vertebrate species including the 
mouse. For a review of the mechanisms proposed to mediate RNAi, please refer 
to Bass et al, 2001, Elbashir et al, 2001 or Brantl 2002. 

"RNA interference (RNAi)" is the process of sequence-specific, post- 
20 transcriptional gene silencing mitiated by siRNA. RNAi is seen in a number of 
organisms such as Drosophila, nematodes, fungi and plants, and is believed to 
be involved in anti-viral defense, modulation of transposon activity, and 
regulation of gene expression. During RNAi, siRNA induces degradation of 
target mRNA with consequent sequence-specific inhibition of gene expression. 
25 A "small intCTfering*' or "short interfering RNA" or siRNA is a RNA 

• duplex of nucleotides that is targeted to a gene interest, A "RNA duplex" refers 
to the structure formed by the complementary pairing between two regions of a 
RNA molecxile, siRNA is "targeted" to a gene in that the nucleotide sequence of 
the duplex portion of the siRNA is complementary to a nucleotide sequence of 
30 the targeted gene. In some embodiments, the length of the duplex of siRNAs is 
less than 30 nucleotides. In some embodiments, the duplex can be 29, 28, 27, 
26, 25, 24, 23, 22, 21, 20, 19, 18, 17, 16, 15, 14. 13, 12, 1 1 or 10 nucleotides in 
length. In some embodiments, the length of the duplex is 19 - 25 nucleotides in 
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length. The RNA duplex portion of the siRNA can be part of a haiipin structure. 
In addition to the diq)lex portion, the haiipin structure may contain a loop 
portion positioned between the two sequences that fonn the duplex. The loop 
can vary in length. In some embodiments the loop is 5, 6, 7, 8, 9, 10, 11, 12 or 
5 13 nucleotides in lengfli. The haiipin structure can also contain 3' or 5' overhang 
portions. In some embodiments, the overhang is a 3* or a 5' overhang 0, 1, 2, 3, 
4 or 5 nucleotides in length. 

The siRNA can be encoded by a nucleic add sequence, and the nucleic 
acid sequence can also include a promoter. The nucleic acid sequence can also 
10 include a polyadenylation signal. In some embodiments, the polyadenylation 
signal is a synthetic minimal polyadenylation signal. 

^Treating*' as used herein refers to ameliorating at least one symptom of, 
curing and/or preventing the development of a disease or a condition. 

*'Neurological disease" and **neurological disorder" refer to both 
1 5 hereditary and sporadic conditions that are characterized by nervous s)^tem 

dysfunction, and which may be associated with atrophy of the adBfected central or 
peripheral nervous system structures, or loss of function without atrophy. A 
neurological disease or disorder that results in atrophy is commonly called a 
'^neurodegenerative disease" or **neurodegenerative disorder." 
20 Neurodegenerative diseases and disorders include, but are not limited to, 
amyotrophic lateral sclerosis (ALS), hereditary spastic hemiplegia, primary 
lateral sclerosis, spinal muscular atrophy, Kennedy's disease, Alzheimer's 
disease, Parkinson's disease, multiple sclerosis, and repeat expansion 
neurodegenerative diseases, e,g,y diseases associated with expansions of 
trinucleotide repeats such as polyglutamine (polyQ) repeat diseases, 
Huntington's disease (HD), spinocerebellar ataxia (SCAl, SCA2, SCA3, SCA6, 
SCA7, and SCAl 7), spinal and bulbar muscular atrophy (SBMA), 
dentatorubropallidoluysian atrophy (DRPLA). An example of a neurological 
disorder that does not appear to result in atrophy is DYTl dystonia. 

n. Nucleic Acid Molecules of the Invention 
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Sources of nucleotide sequences fijom which the present nucleic acid 
molecules can be obtained inchide any vertebrate, preferably manmalian, 
cellular source. 

As discussed above, the terms "isolated and/or purified" refer to in vitro 
5 isolation of a nucleic add, e.g.. a DNA or RNA molecule fixnn its natural 
cellular environment, and fiom association witti other conqwnents of die cell, 
such as nucleic acid or polypeptide, so that it can be sequenced, repUcated, 
and/or expressed. For example, "isolated nucleic acid" may be a DNA molecule 
containing less than 31 sequential nucleotides that is transcribed into an siRNA. 
10 Such an isolated siRNA may, for exanq)le, form a hairpin structure with a 

duplex 21 base pairs in length that is complemaitary or hybridizes to a sequence 
in a gene of interest, and remains stably bound under stringent conditions (as 
defined by methods well known in <he art, e.g., in Sambrook and RusseU, 2001). 
Thus, the RNA or DNA is "isolated" in Aat it is fi^ ftom at least one 
15 contaminating nucleic acid with which it is normally associated in the natural 
source of the RNA or DNA and is preferably substantially fi-ee of any other 
mammalian RNA or DNA. The phrase "fiee fiom at least one contaminating 
source nucleic add with which it is normaUy associated" includes the case where 
the nucldc acid is reintroduced into the source or natural ceU but is in a different 
20 chromosomal location or is otherwise flanked by nucldc acid sequences not 
normally found in the source cell, e.g., in a vector or plasmid. 

In addition to a DNA sequence encoding a siRNA, the nucldc acid 
molecules of the invention include double-stranded interfering RNA molecules, 
\^ch are also useful to inhibit expression of a target gene. 
25 As used herein, the term '•recombinant nucldc add", e.g. , "recombinant 

DNA sequence or segment" refers to a nucleic add, e.g., to DNA, that has been 
derived or isolated fiom any appropriate cellular source, that may be 
subsequently chemically altered in vitro, so that its sequence is not naturally 
occurrinfe or corresponds to naturally occurring sequences that are not 
30 positioned as they would be positioned in a genome which has not been 

transformed with exogenous DNA. An example of preselected DNA "derived" 
fi^m a source, would be a DNA sequence that is identified as a useful fragment 
wittiin a given organism, and which is flien chemically synthesized in essentially 
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30 



pure fonn. An example of such DNA "isolated" fiom a source would be a useful 
DNA sequence that is excised or removed fiom said source by chemical means, 
e.g.. by the use of restriction endonucleases, so that it can be fiirttier 
manq)ulated. e.g., amplified, for use in the invention, by the methodology of 
genetic engineering. 

Thus, recovery or isolation of a given fiagment of DNA from a 
restriction digest can employ separation of the digest on polyacrylamide or 
agarose gel by electrophoresis, identification of the fiagment of interest by 
comparison of its mobiUty versus that of marker DNA fragments of known 
molecular weight, removal of the gel section containing the desired fragment, 
andseparationofthegelfiomDNA. SeeUwne/a/. (1981), and Goeddel eM/. 
(1980). Therefore, "recombinant DNA" includes completely synthetic DNA 
sequences, semi-synthetic DNA sequences, DNA sequences isolated from 
biological sources, and DNA sequences derived from RNA, as weU as mixtores 
15 thereof. 

Nucleic add molecules having base substitutions (I.C., variants) are 
prepared by a variety of methods known in the art. These methods include, but 
are not lunited to, isolation fiom a natural source (in the case of naturaUy 
occurring sequence variants) or preparation by oUgonucleotide-mediated (or 
site-directed) mutagenesis, PGR mutagenesis, and cassette mutagenesis of an 
earlier prq)ared variant or a non-variant version of the nucleic acid molecule. 

OKgonucleotide-mediated mutagenesis is a method for preparing 
substitution variants. This technique is known in the art as described by 
Adehnan et al (1983). Briefly, nucleic acid encoding a slRNA can be altered by 
hybridizing an oUgonucleotide encoding flie desired mutation to a DNA 
template, where the template is the single-stranded fomi of a plasmid or 
bacteriophage containing the unaltered or native gene sequence. After 
hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template that will thus incorporate the 
oligonucleotide primer, and wiU code for the selected alteration in the nucleic 
acid encoding siRNA. Generally, oUgonucIeotides of at least 25 nucleotides in 
length are used. An optimal oUgonucleotide will have 12 to 15 nucleotides that 
are completely complementary to the template on either side of the nucleotide(s) 
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coding for the mutation. This ensures that the oKgonucleotide will hybridize 
properly to the single-stranded DNA template molecule. The oligonucleotides 
are readily synthesized using techniques known in the art such as that described 
byCreae^a/.(1978). 
5 The DNA template can be generated by those vectors that are either 

derived fix)m bacteriophage M13 vectors (the commercially ayailable M13mpl8 
and M13n[q)19 vectors are suitable), or fliose vectors that contain a 
single-stranded phage origin of replication as described by Viera et al (1987). 
Thus, the DNA that is to be mutated may be inserted iato one of these vectors to 
10 generate single-stranded template. Production of the single-stranded template is 
■ described in Chapter 3 of Sambrook and Russell, 2001 . Alternatively, 
single-stranded DNA template may be generated by denaturing double-stranded 
plasmid (or other) DNA usmg standard techniques. 

For alteration of the native DNA sequence (to generate amino acid 
1 5 sequence variants, for example), the oligonucleotide is hybridized to the 
smgle-stranded template under suitable hybridization conditions. A DNA 
polymerizing enzyme, usually the Klenow fragment of DNA polymerase I, is 
then added to synthesize the complementary strand of the template vising the 
oligonucleotide as a primer for synthesis. A heteroduplex molecule is thus 
20 formed such that one strand of DNA encodes the mutated form of the DNA, and 
the other strand (the original template) encodes the native, unaltered sequence of 
the DNA. This heteroduplex molecule is then transformed into a suitable host 
cell, usually a prokaryote such as E, coli JMIOI. After the cells are grown, they 
are plated onto agarose plates and screened usiag the oligonucleotide primer 
25 radiolabeled with 32-phosphate to identify the bacterial colonies that contain the 
mutated DNA. The mutated region is then removed and placed in an appropriate 
vector, generally an expression vector of the type typically employed for 
transformation of an appropriate host. 

The method described irmnediately above may be modified such that a 
30 homoduplex molecule is created wherein both strands of the plasmid contain the 
mutations(s). The modifications are as follows: The single-stranded 
oligonucleotide is aimealed to the single-stranded template as described above. 
A mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), 
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deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with 
a modified thiodeoxyribocytosine caUed dCTP-(*S) (which can be obtained from 
the Amersham Corporation). This mixture is added to the 
template-oligonucleotide complex. Upon addition of DNA polymerase to this 
5 mixture, a strand of DNA identical to the template except for the mutated bases 
is generated. In addition, this new strand of DNA will contain dCTP-(*S) 
instead of dCTP, which serves to protect it fiom restriction endonuclease 
digestion. 

After the template strand of the double-stranded heteroduplex is nicked 
10 with an appropriate restriction enzyme, the template strand can be digested with 
Exom nuclease or another appropriate nuclease past the region that contains the 
site(s) to be mutagenized. The reaction is then stopped to leave a molecule that 
is only partially single-stranded. A complete double-stranded DNA homoduplex 
is then formed using DNA polymerase in the presence of all four 
15 deoxyribonucleotide triphosphates, ATP, and DNA Hgase. This homoduplex 
molecule can then be transfonned into a suitable host cell such as E. coli JMIOI. 

BB. Expression Cassettes of the Invention 

To prepare e3q)ression cassettes, the recombinant DNA sequence or 

20 segment may be circular or linear, double-stranded or single-stranded. 

Generally, the DNA sequence or segment is in the form of chimeric DNA, such 
as plasmid DNA or a vector that can also contain coding regions flanked by 
control sequences that promote the expression of the recombinant DNA present 
in the resultant transformed cell. 

25 A "chimeric" vector or expression cassette, as used herein, means a 

vector or cassette including nucleic acid sequences from at least two different 
species, or has a nucleic acid sequence from the same species that is linked or 
associated in a manner that does not occur in the **native" or wild type of the 
species. 

30 Aside from recombinant DNA sequences that serve as transcription units 

for an RNA transcript , or portions thereof, a portion of the recombinant DNA 
may be untranscribed, serving a regulatory or a structural function. For example, 
the recombinant DNA may have a promoter that is active in mammalian cells. 
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Other elraients functional in the host cells, such as introns, enhancers, 
polyadenylation sequences and the like, may also be a part of the recombinant 
DNA. Such elements may or may not be necessary for the function of the DNA, 
but may provide improved e3q)ression of the DNA by affecting transcription, 
5 stability of the siRNA, or the like. Such elements may be included in the DNA 
as desired to obtain the optunal performance of the siKNA in the cell. 

Control sequences are DNA sequences necessary for the expression of an 
operably linked coding sequence in a particular host organism. The control 
sequences that are suitable for prokaryotic cells, for example, include a 
10 promoter, and optionally an operator sequence, and a ribosome binding site. 
Eukaryotic cells are known to utilize promoters, polyadenylation signals, and 
enhancers. 

Operably linked nucleic acids are nucleic acids placed in a fimctional 
relationship with another nucleic acid sequence. For example, a promoter or 

15 enhancer is operably linked to a coding sequence if it affects the transcription of 
the sequence; or a ribosome binding site is operably linked to a coding sequence 
if it is positioned so as to faciUtate translatioa Generally, operably linked DNA 
sequences are DNA sequences that are linked are contiguous. However, 
enhancers do not have to be contiguous. Linking is accomplished by ligation at 

20 convenient restriction sites. Ifsuchsitesdonot exist, the synthetic 

oligonucleotide adaptors or linkers are used in accord with conventional practice. 

The recombinant DNA to be introduced into the cells may contain either 
a selectable marker geae or a reporter gene or both to facilitate identification and 
selection of expressing cells firom the population of cells sought to be transfected 

25 or infected through viral vectors. In other embodiments, the selectable marker 
may be carried on a separate piece of DNA and used in a co-transfection 
procedure. Both selectable markers and reporter genes may be flanked with 
appropriate regulatory sequences to enable expression in the host cells. Useful 
selectable markers are known in the art and include, for example, antibiotic- 

30 resistance genes, such as neo and the like. 

Reporter genes are used for identifying potentially transfected cells and 
for evaluating the functionality of regulatory sequences. Reporter genes that 
encode for easily assayable proteins are well known in the art. In general, a 
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reporter gene is a gene that is not present in or expressed by the recipient 
organism or tissue and that encodes a protein whose expression is manifested by 
some easily detectable property, e,g,y enzymatic activity. For example, reporter 
genes include the chloramphenicol acetyl transferase gene (cat) jfrom Tn9 of ^. 
5 coli and the luciferase gene fiom Stefly Photinus pyralis. Expression of the 
reporter gene is assayed at a suitable time after the DNA has been introduced 
into the recipient cells. 

The general methods for constructing recombinant DNA that can 
transfect target cells are well known to those skilled in the art, and the same 

10 compositions and methods of constmction may be utilized to produce the DNA 
useful herein. For example, Sambrook and Russell, infra, provides suitable 
methods of construction. 

The recombinant DNA can be readily introduced into the host cells, e.g., 
manmialian, bacterial, yeast or insect cells by transfection with an expression 

1 5 vector composed of DNA encoding the siRNA by any procedure useful for the 
introduction into a particular cell, physical or biological methods, to yield a 
cell having the recombinant DNA stably integrated into its genome or existing as 
a episomal element, so that the DNA molecules, or sequences of the present 
invention are expressed by the host cell. Preferably, the DNA is introduced into 

20 host cells via a vector. The host cell is preferably of eukaryotic origin, , 
plant, mammalian, insect, yeast or fungal sources, but host cells of non- 
eukaryotic origin may also be employed. 

Physical methods to introduce a preselected DNA iato a host cell include 
calcium phosphate precipitation, hpofection, particle bombardment, 

25 microinjection, electroporation, and the like. Biological methods to introduce 
the DNA of interest into a host cell include the use of DNA and RNA viral 
vectors. For mammaUan gene therapy, as described hereinbelow, it is desirable 
to use an efficient means of inserting a copy gene into the host genome. Viral 
vectors, and especially retroviral vectors, have become the most widely used 

30 method for inserting genes into manmiaUan, e.g., human cells. Other viral 

vectors can be derived from poxviruses, herpes simplex vims I, adenoviruses and 
adeno-associated viruses, and the like. See, for example, U.S. Patent Nos. 
5,350,674 and 5,585,362. 
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As discussed above, a **transfected", "or **traiisduced" host cell or cell 
line is one in which the gaiome has been altered or augmented by the presence 
of at least one heterologous or recombinant nucleic acid sequence. The host 
cells of the present invention are typically produced by transfection with a DNA 
5 sequence in a plasmid expression vector, a viral expression vector, or as an 
isolated linear DNA sequence. The transfected DNA can become a 
chromosomally integrated recombinant DNA sequence, which is composed of 
sequence encoding the siRNA 

To conJBrm the presence of the recombinant DNA sequence in the host 

1 0 cell, a variety of assays may be performed. Such assays include, for example, 
"molecular biological" assays well known to those of skill in the art, such as 
Southern and Northem blotting, RT-PCR and PGR; "biochemical" assays, such 
as detecting the presence or absence of a particular peptide, e,g., by 
immunological means (ELISAs and Western blots) or by assays described herein 

15 to identify agents falling within the scope of the invention. 

To detect and quantitate RNA produced from introduced recombinant 
DNA segments, RT-PCR may be employed. In this application of PGR, it is 
first necessary to reverse transcribe RNA into DNA, using enzymes such as 
reverse transcriptase, and then through the use of conventional PGR techniques 

20 amplify the DNA. In most instances PGR techniques, while useful, will not 

demonstrate integrity of the RNA product. Further infoimation about the nature 
of the RNA product may be obtained by Northem blotting. This technique 
demonstrates the presence of an RNA species and gives information about the 
integrity of that RNA. The presence or absence of an RNA species can also be 

25 determined using dot or slot blot Northem hybridizations. These techniques are 
modifications of Northem blotting and only demonstrate the presence or absence 
of an RNA species. 

While Southern blotting and PGR may be used to detect the recombinant 
DNA segment in question, they do not provide information as to whether the 

30 preselected DNA segment is being expressed. Expression may be evaluated by 
speciJBcally identifying the peptide products of the introduced recombinant DNA 
sequences or evaluating the phenotypic changes brought about by the expression 
of the introduced recombinant DNA segment in the host cell. 
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The instant invention provides a cell expression system for expressing 
exogenous nucleic acid material in a mammalian recipient The expression 
system, also referred to as a "genetically modified cell", comprises a cell and an 
expression vector for expressing the exogenous nucleic acid material. The 
5 genetically modified cells are suitable for administration to a mammaUan 
recipient, where they replace the endogenous cells of flie recipient Thus, tiie 
preferred genetically modified cells are non-immortalized and are non- 
tumorigenic. 

According to one embodiment, the cells are transfected or otherwise 
10 genetically modified ex vivo. The cells are isolated from a mammal (preferably 
a human), nucleic acid introduced transduced or transfected in vitro) with a 
vector for expressing a heterologous recombinant) gene encoding the 
therapeutic agent, and then administered to a mammalian recipient for delivery 
of the therapeutic agent in situ. The mammalian recipient may be a human and 
15 the cells to be modified are autologous cells, /.a, the cells are isolated from the 
mammahan recipient. 

According to another embodiment, the cells are transfected or transduced 
or otherwise genetically modified iTivfvo. The cells from the mammaUan 
recipient are transduced or transfected in vivo with a vector containing 
20 exogenous nucleic acid material for expressing a heterologous (e,g, , 

recombinant) gene encoding a therapeutic agent and the therapeutic agent is 
deUvered in situ. 

As used herein, "exogenous nxicleic acid material" refers to a nucleic acid 
or an oligonucleotide, either natural or synthetic, which is not naturally found in 

25 the cells; or if it is naturally found in the cells, is modified from its original or 
native form. Thus, "exogenous nucleic acid material" includes, for example, a 
non-naturally occurring nucleic acid that can be transcribed into an anti-sense 
RNA, a siKNA, as well as a 'lieterologous gene" a gene encoding a protein 
that is not expressed or is expressed at biologically insignificant levels in a 

30 naturally-occurring cell of the same type). To illustrate, a synthetic or natural 
gene encoding human erythropoietin (EPO) would be considered "exogenous 
nucleic acid material" with respect to human peritoneal mesothelial cells since 
the latter cells do not naturally express EPO. Still another example of 
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"exogenous nucleic acid material*' is the introduction of only part of a gene to 
create a recombinant gene, such as combining an regulatable promoter with an 
endogenous codmg sequence via homologous recombination. 

5 IV. Promoters of the Invention 

As described herein, an expression cassette of the invention contains, inter alia^ 
a promoter. Such promoters include the CMV promoter, as well as the RS V 
promoter, S V40 late promoter and retroviral LTRs (long terminal repeat 
elements), or brain cell specific promoters, although many other pix^moter 

10 elements well known to the art, such as tissue specific promoters or regulatable 
promoters may be employed in the practice of the invention. 

In one embodiment of the present invention, an expression cassette may 
contain a pel n promoter that is operably linked to a nucleic acid sequence 
encoding a siRNA. Thus, the pol 11 promoto:, i. e., a RNA polymerase n 

1 5 dependent promoter, initiates the transcription of the siRNA. In another 
embodiment, &e pol n promoter is regulatable. 

Three RNA polymerases transcribe nuclear genes in eukaryotes. RNA 
polymerase n (pol U) synthesizes mRNA, Le,, pol n transcribes the genes that 
encode protems. In contrast, RNA polymerase I (pol I) and RNA polymerase 

20 in ^ol IQ) transcribe only a limited set of transcripts, synthesizing RNAs that 
have structural or catalytic roles. RNA polymerase I makes the large ribosomal 
. RNAs (rRNA), which are under the control ofpol I promoters. RNA 
polymerase HI makes a variety of small, stable RNAs, including the small 5S 
rRNA and transfer RNAs (tRNA), the transcription of which is under the control 

25 of pol ni promoters. 

As described herein, the inventors unexpectedly discovered that pol n 
promoters are useful to direct transcription of the siRNA. This was surprising 
because, as discussed above, pol U promoters are thought to be responsible for 
transcription of messenger RNA, z.e., relatively long RNAs as compared to 

30 RNAs of 30 bases or less. 

A pol n promoter may be used in its entirety, or a portion or firagment of the 
promoter sequence may be used in which the portion maintains the promoter 
activity. As discussed herein, pol II promoters are known to a skilled person in 
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the art and include the promoter of any protein-encoding gene, e.g., an 
endogenously regulated gene or a constitutively expressed gene. For example, 
the promoters of genes regulated by cellular physiological events, e.g,, heat 
shock, oxygen levels and/or carbon monoxide levels, e.g., in hypoxia, may be 
5 used in the expression cassettes of the invention. In addition, the promoter of 
any gene regulated by the presence of a pharmacological agent, e.g., tetracycline 
and derivatives thereof, as well as heavy metal ions and hormones may be 
employed in the expression cassettes of the invention. In an embodiment of the 
invention, the pol n promoter can be the CMV promoter or the RSV promoter. 

10 In another embodiment, the pol n promoter is the CMV promoter. 

As discussed above, a pol n promoter of the invention may be one 
naturally associated with an endogenously regulated gene or sequence, as may 
be obtained by isolating the 5' non-coding sequences located upstream of the 
coding segment and'or axon. The pol 11 promoter of the expression cassette can 

15 be, for example, the same pol n promoter driving expression of the targeted gene 
of interest. Alternatively, the nucleic acid sequence encoding the siRNA may be 
placed under the control of a recombinant or heterologous pol II promoter, which 
refers to a promoter that is not normally associated with the targeted gene's 
natural environment. Such promoters include promoters isolated from any 

20 eukaryotic cell, and promoters not **naturally occurring," te., containing 
different elements of different transcriptional regulatory regions, and/or 
mutations that alter expression. In addition to producing nucleic acid sequences 
of promoters synthetically, sequences maybe produced using recombinant 
cloning and/or nucleic acid amplification technology, including PGR™, in 

25 connection with the compositions disclosed herein (see U.S. Patent 4,683,202, 
U.S. Patent 5,928,906, each incorporated herein by reference). 

In one embodiment, a pol n promoter that effectively directs the 
expression of the siRNA in the cell type, organelle, and organism chosen for 
expression will be employed. Those of ordinary skill in the art of molecular 

30 biology generally know the use of promoters for protein expression, for example, 
see Sambrook and Russell (2001), incorporated herein by reference. The 
promoters employed may be constitutive, tissue-specific, inducible, and/or useful 
under the appropriate conditions to direct high level expression of the introduced 
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DNA segment, such as is advantageous in the large-scale production of 
recomhinant proteins and/or peptides. The identity of tissue-specific promoters, 
as well as assays to characterize their activity, is well known to those of ordinary 
skill in the art. 

5 

V. Methods fo r Introdacing the Expression Cassettes of the 

Invention into Cells 
The condition amenable to gene inhibition iherzpy may be a prophylactic 
process, z.e., a process for preventing disease or an undesired medical condition. 
10 Thus, the instant invention embraces a system for delivering siKNA that has a 
prophylactic function a prophylactic agent) to the mammalian recipient. 

The inhibitory nucleic acid material an expression cassette 
encoding siRNA directed to a gene of interest) can be introduced into the cell ex 
vivo or in vivo by genetic transfer methods, such as transfection or transduction, 
15 to provide a genetically modified cell Various expression vectors (/. e. , vehicles 
for fadUtating deUvery of exogenous nucleic acid into a target cell) are known to 
one of ordinary skill in the art. 

As used herein, "transfection of cells" refers to the acquisition by a cell 
of new nucleic acid material by incorporation of added DNA. Thus, transfection 
20 refers to the insertion of nucleic acid into a cell using physical or chemical 

methods. Several transfection techniques are known to those of ordinary skill in 
the art including: calcium phosphate DNA co-precipitation (Methods in 
Molecular Biology (1991)); DEAE-dextran (supra); electroporation (supra); 
cationic Uposome-mediated transfection (supra); and tungsten particIe-faciUtated 
25 microparticle bombardment (Johnston (1990)). Strontium phosphate DNA co- 
precipitation (Brash et al (1987)) is also a transfection method. 

In contrast, "transduction of cells" refers to the process of transferring 
nucleic acid into a cell using a DNA or RNA virus. A RNA virus (z. e. , a 
retrovirus) for transferring a nucleic acid into a cell is referred to herein as a 
30 transducing chimeric retrovirus. Exogenous nucleic acid material contained 
within the retrovirus is incorporated into the genome of the transduced cell. A 
cell that has been transduced with a chimeric DNA virus (e.g., an adenovmis 
carrying a cDNA encoding a therapeutic agent), will not have the exogenous 
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nucleic acid material incorporated into its genome but will be capable of 
expressing the exogenous nucleic acid material that is retained 
extrachiomosomally within the cell. 

The exogenous nucleic add material can include the nucleic acid 
S encoding the siRNA together with a promoter to control transcription. The 
promoter characteristically has a specific nucleotide sequence necessary to 
initiate transcription. The exogenous nucleic acid material may further include 
additional sequences (Le,, enhancers) required to obtain the desired gene 
transcription activity. For the purpose of this discussion an "enhancer" is simply 

1 0 any non-translated DNA sequence that worics with the coding sequence (in cis) 
to change the basal transcription level dictated by the promoter. The exogenous 
nucleic acid material may be introduced into the cell genome immediately 
downstream from the promoter so that the promoter and coding sequence are 
opCTatively linked so as to permit transcription of the coding sequence. An 

1 5 expression vector can include an exogenous promoter element to control 

transcription of the inserted exogenous gene. Such exogenous promoters include 
both constitutive and regulatable promoters. 

Naturally-occurring constitutive promoters control the expression of 
essential cell functions. As a result, a nucleic acid sequence under the control of 

20 a constitutive promoter is expressed under all conditions of cell growth. 

Constitutive promoters include the promoters for the following genes which 
encode certain constitutive or "housekeeping" functions: hypoxanthine 
phosphoribosyl transferase (HPRT), dihydrofolate reductase (DHFR) 
(Scharfinann et al (1991)), adenosine deammase, phosphoglycerol kmase 

25 (PGK), pyruvate kinase, phosphoglycerol mutase, the beta-actin promoter (Lai et 
al. (1989)), and other constitutive promoters known to those of skill in the art. 
In addition, many viral promoters function constitutively in eukaryotic cells. 
These include: the early and late promoters of SV40; the long tenninal repeats 
(LTRs) of Moloney Leukemia Virus and other retroviruses; and the thymidine 

3Q kinase promoter of Herpes Simplex Virus, among many others. 

Nucleic acid sequences that are under the control of regulatable 
promoters are expressed only or to a greater or lesser degree in the presence of 
an inducing or repressing agent, (e.g., transcription under control of the 
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metallothionein promoter is greatly increased in presence of certain metal ions), 
Regulatable promoters include responsive elements (REs) that stimulate 
transcription when their inducing factors are bound. For example, there are REs 
for serum factors, steroid hormones, retinoic acid, cyclic AMP, and tetracycline 
5 and doxycycline. Promoters containing a particular RE can be chosen in order to 
obtain an regulatable response and in some cases, the RE itself may be attached 
to a different promoter, thereby conferring regulatability to the encoded nucleic 
acid sequence. Thus, by selecting the appropriate promoter (constitutive versus 
regulatable; strong versus weak), it is possible to control both the existence and 

1 0 level of expression of a nucleic acid sequence ia the genetically modified cell. If 
the nucleic acid sequence is under the control of an regulatable promoter, 
delivery of the therapeutic agent in situ is triggered by exposing the genetically 
modified cell in situ to conditions for permitting transcription of the nucleic acid 
sequence, e.^., by intraperitoneal injection of specific inducers of the regulatable 

15 promoters which control transcription of the agent. For example, in situ 

expression of a nucleic acid sequence under the control of the metallothioneiu 
promoter in genetically modified cells is enhanced by contacting the genetically 
modified cells with a solution containing the appropriate (j.e., inducing) metal 
ions in situ. 

20 Accordingly, the amount of siEUNfA generated in situ is regulated by 

controlling such factors as the nature of the promoter used to direct transcription 
of the nucleic acid sequence, (i.e,, whether the promoter is constitutive or 
regulatable, strong or weak) and the number of copies of the exogenous nucleic 
acid sequence encoding a siRNA sequence that are in the cell. 

25 In addition to at least one promoter and at least one heterologous nucleic 

acid sequence encoding the siRNA, the expression vector may include a 
selection gene, for example, a neomycin resistance gene, for facilitating selection 
of cells that have been transfected or transduced with the expression vector. 

Cells can also be transfected with two or more expression vectors, at least 

30 one vector containing the nucleic acid sequence(s) encoding the siRNA(s), the 
other vector containing a selection gene. The selection of a suitable promoter, 
enhancer, selection gene and/or signal sequence is deemed to be within the scope 
of one of ordinary skill in the art without imdue experimentation. 
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The following discussion is directed to various utilities of the instant 
invention. For example, the instant invention has utility as an expression system 
suitable for silencing the expression of gene(s) of interest 

The instant invention also provides various methods for making and 
5 using the above-described genetically-modified cells. 

The instant invention also provides methods for genetically modifying 
cells of a mammalian recipient in vivo. According to one embodiment, the 
method comprises introducing an expression vector for expressiag a siRNA 
sequence in cells of the mammalian recipient in situ by, for example, mjecting 
10 the vector into the recipient. 

VI. Delivery Vehicles for the Expression Cassettes of the 
Invention 

Delivery of compounds into tissues and across the blood-brain barrier 

15 can be limited by the size and biochemical properties of the compounds. 

Currently, eflBcient delivery of compounds into cells in vivo can be achieved 
only when the molecules are small (usually less than 600 Daltons). Gene 
transfer for the correction of inborn errors of metabolism and neurodegenerative 
diseases of the central nervous system (CNS), and for the treatment of cancer has 

20 been accomplished with recombinant adenoviral vectors. 

The selection and optimization of a particular expression vector for 
expressing a specific siRNA in a cell can be accomplished by obtaining the 
nucleic acid sequence of the siRNA, possibly with one or more appropriate 
control regions promoter, insertion sequence); preparing a vector construct 

25 comprising the vector into which is inserted the nucleic acid sequence encoding 
the siRNA; transfecting or transducing cultured cells in vitro with the vector 
construct; and determining whether the siRNA is present in the cultured cells. 

Vectors for cell gene therapy include viruses, such as replication- 
deficient viruses (described in detail below). Exemplary viral vectors are 

30 derived firom Harvey Sarcoma virus, ROUS Sarcoma virus, (MPSV), Moloney 
murine leukemia virus and DNA viruses (e.g., adenovirus) (Temin (1986)), 

Replication-deficient retroviruses are capable of directing synthesis of all 
virion proteins, but are incapable of making infectious particles. Accordingly, 
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these genetically altered retroviral expression vectors have general utility for 
high-efiBciency transduction of nucleic acid sequences in cultured cells, and 
specific utility for use in the method of the present invention. Such retroviruses 
further have utility for the efBcient transduction of nucleic acid sequences into 
5 cells in vivo. Retroviruses have been used extensively for transferring nucleic 
acid material into cells. Standard protocols for producing replication-deficient 
retroviruses (including the steps of incorporation of exogenous nucleic acid 
material into a plasmid, transfection of a packaging cell line with plasmid, 
production of recombinant retroviruses by the packaging cell line, collection of 

10 viral particles from tissue culture media, and infection of the target cells with the 
viral particles) are provided in Kriegler (1990) and Murray (1991). 

An advantage of using retroviruses for gene therapy is that the viruses 
insert the nucleic acid sequence encoding the siRNA into the host cell genome, 
thereby permitting the nucleic acid sequence encoding the siRNA to be passed 

15 on to the progeny of the cell when it divides. Promoter sequences in the LTR 
region have been reported to enhance expression of an inserted coding sequence 
in a variety of cell types (see e.g., Hilberg et al (1987); Holland et al (1987); 
Valerio et al (1989). Some disadvantages of using a retrovirus expression 
vector are (1) insertional mutagenesis, i.e., the insertion of the nucleic acid 

20 sequence encoding the siRNA into an undesirable position in the target cell 
genome which, for example, leads to unregulated cell growth and (2) the need 
for target cell proliferation in order for the nucleic acid sequence encoding the 
siRNA carried by the vector to be integrated into the target genome (Miller et al 
(1990)). 

25 Another viral candidate usefiil as an expression vector for transformation 

of cells is the adenovirus, a double-stranded DNA virus. The adenovirus is 
infective in a wide range of cell types, including, for example, muscle and 
endothelial cells (Larrick and Burck (1991)). The adenovirus also has been used 
as an expression vector in muscle cells in vivo (Quantin et al (1992)). 

30 Adenoviruses (Ad) are double-stranded linear DNA viruses with a 36 kb 

genome. Several features of adenovims have made them useful as transgene 
delivery vehicles for therapeutic applications, such as facilitating in vivo gene 
delivery. Recombinant adenovirus vectors have been shown to be capable of 
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efficient in situ gene transfer to parenchymal cells of various organs, including 
the lung, brain, pancreas, gallbladder, and liver. This has allowed the use of 
these vectors in methods for treating inherited genetic diseases, such as cystic 
fibrosis, where vectors may be delivered to a target organ. In addition, the 
5 ability of the adenovirus vector to acconq)lish in situ tumor transduction has 
allowed the development of a variety of anticancer gene therapy methods for 
non-disseminated disease. In these methods, vector containment favors tumor 
cell-specific transduction. 

Like the retrovirus, the adenovirus genome is adaptable for use as an 

10 expression vector for gene therapy, x.e., by removing the genetic information that 
controls production of the virus itself (Rosenfeld et al (1991)). Because the 
adenovirus functions in an extrachromosomal fashion, the recombinant 
adenovirus does not have the theoretical problem of insertional mutagenesis. 
Several approaches traditionally have been used to generate the 

15 recombinant adenoviruses. One ^proach involves direct ligation of restriction 
endonuclease fi:agments containing a nucleic acid sequence of interest to 
portions of the adenoviral genome. Alternatively, the nucleic acid sequence of 
interest may be inserted into a defective adenovirus by homologous 
recombination results. The desired recombinants are identified by screening 

20 individual plaques generated in a lawn of complementation cells. 

Most adenovirus vectors are based on the adenovirus type 5 (Ad5) 
backbone in which an expression cassette containing the nucleic acid sequence 
of interest has been introduced in place of the early region 1 (El) or early region 
3 (E3). Viruses in which El has been deleted are defective for replication and 

25 are propagated in human complementation cells (e.^., 293 or 91 1 cells), which 
supply the missing gene El and pIX in trans. 

In one embodiment of the present invention, one will desire to generate 
siRNA in a brain cell or brain tissue. A suitable vector for this application is an 
FIV vector (Brooks et al (2002); Alisky et al (2000a)) or an AAV vector. For 

30 example, one may use AAV5 (Davidson et al (2000); Alisky et al (2000a)). 
Also, one may apply poUovirus (Bledsoe et al (2000)) or HS V vectors (AUsky 
era/. (2000b)). 



57 



wo 2004/013280 



PCT/US2003/016887 



Thus, as will be apparent to one of ordinary skill in the art, a variety of 
suitable viral expression vectors are available for transferring exogenous nucleic 
acid material into cells. The selection of an appropriate egression vector to 
express a theKq>eutic agent for a particular condition amenable to gene silencing 
5 therapy and the optimization of the conditions for insertion of the selected 

e?q)ression vector into the cell, are within the scope of one of ordinary skill in the 
art without the need for undue experimentation. 

In anotiier embodiment, the expression vector is in the form of a plasmid, 
which is transferred into the target cells by one of a variety of methods: physical 
10 (eg., microinjection (C^ecchi (1980)), electroporation (Andreason and Evans 
(1988), scrape loading, microparticle bombardment (Johnston (1990)) or by 
cellular uptake as a chemical complex (eg., calcium or strontium co- 
precipitation, complexationwith lipid, complexation with ligand) (Methods in 
Molecular Biology (1991)). Several commercial products are available for 
1 5 cationic liposome complexation including Lipofectin™ (Gibco-BRL, 
Gaitiiersburg, Md.) (Feigner et al. (1987)) and Transfectam™ (ProMega, 
Madison, Wis.) (Behr et al (1989); LoefiQer et al (1990)). However, the 
efiSciency of transfection by these methods is highly dependent on the nature of 
the target ceU and accordingly, the conditions for optimal transfection of nucleic 
20 acids into cells using tiie above-mentioned procedures must be optimized. Such 
optimization is wifliin the scope of one of ordinary skill in the art without the 
need for undue experimentatioiL 

VII, Diseases and Conditions Amendable to the Methods of the 

25 Invention 

the certain embodiments of the present invention, a mammalian 
recipient to an expression cassette of the invention has a condition that is 
amenable to gene silencing therapy. As used herein, "gene silencing therapy" 
refers to administration to the recipient exogenous nucleic acid material 

30 encoding a therapeutic siRNA and subsequent expression of the administered 
nucleic acid material in situ. Thus, the phrase "condition amenable to siRNA 
therapy** embraces conditions such as genetic diseases (z.e, a disease condition 
that is attributable to one or more gene defects), acquired pathologies (z.e., a 
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pathological condition that is not attributable to an inborn defect), cancers, 
neurodegenerative diseases, e.g., trinucleotide rq)eat disorders, and prophylactic 
processes (z.e., prevention of a disease or of an undesired medical condition). A 
gene "associated with a condition" is a gene that is either the cause, or is part of 
5 the cause, of the condition to be treated. Examples of such genes include genes 
associated with a neurodegenerative disease (e.g., a trinucleotide-repeat disease 
such as a disease associated with polyglutamine repeats, Huntington's disease, 
and sevoral spinocerebellar ataxias), and genes encoding ligands for chemokines 
involved in the migration of a cancer cells, or chemoldne receptor. Also siRNA 
10 expressed from viral vectors may be used for m vivo antiviral therapy using the 
vector systems described. 

Accordingly, as used herem, the term "therapeutic siRNA" refers to any 
siRNA that has a beneficial effect on the recipient. Thus, "therapeutic siRNA" 
embraces both therapeutic and prophylactic siRNA 
15 DiflFerences between alleles that are amenable to targeting by siRNA 

include disease-causmg mutations as well as polymorphisms that are not 
themselves mutations, but may be Imked to a mutation or associated with a 
predisposition to a disease state. Examples of targetable disease mutations 
mclude tau mutations that cause fiontotemporal dementia and the GAG deletion 
20 m the TORIA gene that causes DYTl dj^tonia. An example of a targetable 
polymorphism that is not itself a mutation is the C/G single nucleotide 
polymorphism (G987C)in the MIDI gene immediately downstream of the 
mutation that causes spinocerebellar ataxia type 3 and the polymorphism in exon 
58 associated with Huntington's disease. 
25 Single nucleotide polymorphisms comprise most of the genetic diversity 

between humans, and that many disease genes, including the HD gene in 
Huntington's disease, contain numerous single nucleotide or multiple nucleotide 
polymorphisms that could be separately targeted in one allele vs. the other, as 
shown in Figure 15 The major risk factor for developing Alzheimer's disease is 
30 the presence of a particular polymorphism in the apoUpoprotein E gene. 

A. Gene defects 
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A number of diseases caused by gene defects have been identified. For 
example, this strategy can be applied to a major class of disabling neurological 
disorders. For example this strategy can be applied to the polyglutamine 
diseases, as is demonstrated by the reduction of polyglutamine aggregation in 
5 cells following appHcation of the strategy. The neurodegenerative disease may 
be a trinucleotide-repeat disease, such as a disease associated with polyglutamine 
repeats, including Huntington's disease, and several spinocerebellar ataxias. 
Additionally, this strategy can be appUed to a non-degenerative neurological 
disorder, such as DYTl dystonia. 
10 B. Acquired pathologies 

As \ised herein, "acquired pathology" refers to a disease or syndrome 
manifested by an abnormal physiological, biochemical, cellular, structural, or 
molecular biological state. For example, the disease could be a viral disease, 
such as hepatitis or ADDS. 
15 C. Cancers 

The condition amenable to gene silencing therapy alternatively can be a 
genetic disorder or an acquired pathology that is manifested by abnormal cell 
proliferation, e.g., cancer. According to this embodiment, the instant mventioji 
is useful for silencing a gene involved in neoplastic activity. The present 
20 invention can also be used to inhibit overexpression of one or several genes. The 
present invention can be used to treat neuroblastoma, medulloblastoma, or 
glioblastoma. 

Vin. Dosages. Formulations and Routes of Administration of the 
25 Agents of the Invention 

The agents of die invention are preferably administered so as to result in 
a reduction in at least one symptom associated with a disease. The amount 
administered will vary depending on various factors including, but not hmited 
to, the composition chosen, the particular disease, the weight, the physical 
30 condition, and the age of the mammal, and whether prevention or treatment is to 
be achieved. Such factors can be readily determined by the clinician employing 
animal models or other test systems which are well known to the art. 
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Administration of siRNA may be accomplished through the 
administration of the nucleic acid molecule «icoding the siRNA (see, for 
example, Feigner et al, U.S. Patent No. 5,580,859, PardoU et al 1995; 
Stevenson et al. 1995; Moiling 1997; Donnelly et al 1995; Yang et al H; 
5 Abdallah et al 1995). Phannaceutical formulations, dosages and routes of 
administration for nucleic acids are generally disclosed, for example, in Feigner 
etdl.^siq>ra. 

The present invention envisions treating a disease, for example, a 
neurodegenerative disease, in a mammal by the administration of an agent, eg., 
10 a nucleic acid composition, an expression vector, or a viral particle of the 
invention. Administration of the therapeutic agents in accordance with the 
present invention may be continuous or intermittent, depending, for example, 
upon the recipient's physiological condition, whether the purpose of the 
administration is therapeutic or prophylactic, and other factors known to skilled 
15 practitioners. The administration of the agents of the invention may be 

essentially continuous over a preselected period of time or may be in a series of 
spaced doses. Both local and systemic administration is contemplated 

One or moire suitable unit dosage forms having the ther^eutic agent(s) of 
the invention, which, as discussed below, may optionally be formulated for 
20 sustained release (for example using microencapsulation, see WO 94/07529, and 
U.S. Patent No. 4,962,091 the disclosures of which are incorporated by reference 
herein), can be administered by a variety of routes including parenteral, 
including by intravenous and intramuscular routes, as well as by direct injection 
into the diseased tissue. For example, the therapeutic agent may be directly 
25 injected into the brain. Alternatively the therapeutic agent may be introduced 
intrathecally for brain and spmal cord conditions. In another example, the 
therapeutic agent may be introduced intramuscularly for viruses that traffic back 
to affected neurons from muscle, such as AAV, Antivirus and adenovirus. The 
formulations may, where appropriate, be conveniently presented in discrete unit 
30 dosage forms and may be prepared by any of the methods well known to 

pharmacy. Such methods may include the step of bringing into association the 
therapeutic agent with liquid carriers, solid matrices, semi-solid carriers, finely 
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divided solid carriers or combinations thereof, and then, if necessary, introducing 
or shaping the product into the desired delivery system. 

When the flierapeutic agents of the invention are prepared for 
adndnistration, they are preferably combined with a phannaceutically acceptable 
5 carrier, diluent or excipient to form a pharmaceutical formulation, or unit dosage 
form. The total active ingredients in such formulations include from 0.1 to 
99.9% by weight of the formulation. A "phannaceutically acceptable" is a 
carrier, diluent, excipient, and/or salt that is compatible with the other 
ingredients of the formulation, and not deleterious to the recipient thereof. The 

1 0 active ingredient for administration may be present as a powder or as granules; 
as a solution, a suspension or an emulsion. 

Pharmaceutical formulations containing the therapeutic agents of the 
invention can be prepared by procedures known in the art using well known and 
readily available ingredients. The therapeutic agents of the invention can also be 

15 formulated as solutions appropriate for parenteral administration, for instance by 
intramuscular, subcutaneous or intravenous routes. 

The pharmaceutical formulations of the therapeutic agents of the 
invention can also take the form of an aqueous or anhydrous solution or 
dispersion, or alternatively the form of an emulsion or suspension. 

20 Thus, the therapeutic agent may be formulated for parenteral 

administration (e.g., by injection, for example, bolus injection or continuous 
infiision) and may be presented in unit dose form in ampules, pre-filled syringes, 
small volume infusion containers or in multi-dose containers with an added 
preservative- The active ingredients may take such forms as suspensions, 

25 solutions, or emulsions in oily or aqueous vehicles, and may contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. Alternatively, 
the active ingredients may be in powder form, obtained by aseptic isolation of 
sterile solid or by lyophilization from solution, for constitution with a suitable 
vehicle, sterile, pyrogen-free water, before use. 

30 It will be appreciated that the imit content of active ingredient or 

ingredients contained in an individual aerosol dose of each dosage form need not 
in itself constitute an effective amount for treating the particular indication or 
disease since the necessary effective amoimt can be reached by administration of 
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a plurality of dosage units. Moreover, the efifective amount may be achieved 
using less than the dose in the dosage form, either individually, or in a series of 
administrations. 

The pharmaceutical formulations of the present invention may include, as 
5 optional ingredients, pharmaceutically acceptable carriers, diluents, solubilizing 
or emulsifying agents, and salts of the type that are well-known in the art. 
Specific non-limiting examples of the carriers and/or diluents that are usefiil in 
the pharmaceutical formulations of the present invention include water and 
physiologically acceptable buiBfered saline solutions such as phosphate buffered 
1 0 saline solutions pH 7.0-8.0. saline solutions and water. 

The invention will now be illustrated by the following non-limiting 
Example. 

Example 1 

15 siRNA-Mediated Silencing of Genes Using Viral Vectors 

In this Example, it is shown that gnes can be silenced in an allele-specific 
manner. It is also demonstrated that viral-mediated delivery of siRNA can 
specifically reduce expression of targeted genes in various cell types, both in 
vitro and in vivo. This strategy was then applied to reduce expression of a 

20 neurotoxic polyglutamine disease protein. The abihty of viral vectors to 

transduce cells efficiently in vivo, coupled with the efficacy of virally expressed 
siRNA shown here, extends the appUcation of siRNA to viral-based therapies 
and in vivo targeting experiments that aim to define the fimction of specific 
genes. 

25 

Experimental Protocols 

Generation of the expression cassettes and viral vectors. The 
modified CMV (mCMV) promoter was made by PGR amplification of CMV by 
primers 

30 5*-AAGGTACCAGATCTTAGTTATTAATAGTAATCAATTACGG-3' (SEQ 
lDNO:l)and 

5'-GAATCGATGCATGCCTCGAGACGGTTCACTAAACCAGCTCTGC-3' 

(SEQ ID NO:2) with peGFPNl plasmid (purchased firom Clontech, Inc) as 

template. The mCMV product was cloned into the Kpn/ and Clzl sites of the 
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admoviral shuttle vector pAdSKnpA, and was named pmCMVknpA. To 
construct the minimal polyA cassette, the oligonucleotides, 5 - 
CTAGAACTAGTAATAAAGGATCCTTTATTTTCATTGGAT^^ 
GGTTTTTTGTGTGCGGCCGCG-3^ (SEQ DD N0:3) and 5*- 
5 TCGACGCGGCCGCACACAAAAAACCAACACACGGATCC 

AATGAAAATAAAGGATCCTTTATTACTAGTT.3' (SEQ ID N0:4), were 
used. The oligonucleotides contain Spe/ and Sal/ sites at the 5' and 3* ends, 
respectively. The synthesized polyA cassette was ligated into Spef, Sal/ digested 
pmCMVKnpA. The resultant shuttle plasmid, pmCMVmpA was used for 
10 construction of head-to-head 21bp hairpins of eGFP (bp 418 to 438), human p- 
glucuronidase (bp 649 to 669), mouse p-glucuronidase (bp 646 to 666) or E. coli 
p-galactosidase (bp 1152-1 172). The eGFP hairpins were also cloned into the 
Ad shuttle plasmid containing the commercially available CMV promoter and 
polyA cassette from SV40 large T antigen (pCMVsiGKPx). Shuttle plasmids 
15 were co-transfected mto HEK293 cells along with the adenovirus backbones for 
generation of full-length Ad genomes. Viruses were harvested 6-10 days after 
transfection and amplified and purified as described (Anderson, R,D., et al.. 
Gene Ther. 7:1034-1038 (2000)). 

Northern blotting. Total RNA was isolated from HEK293 cells 
20 transfected by plasmids or infected by adenoviruses using TRIZOL^ileagent 
(Ihvitrogen™ Life Technologies, Carlsbad, CA) according to the manufacturer's 
instruction. RNAs (30ng) were separated by electrophoresis on 15% (wt/vol) 
polyacrylamide-urea gels to detect transcripts, or on 1% agarose-formaldehyde 
gel for target mRNAs analysis. RNAs were transferred by electroblotting onto 
25 hybond N+ membrane (Amersham Pharmacia Biotech). Blots were probed with 
^^P-labeled sense (5»-CACAAGCTGGAGTACAACTAC-3' (SEQ ID N0:5)) or 
antisense (5»-GTAOTGTACTCCAGCTTTGTG-3* (SEQ ID N0:6)) 
oligonucleotides at 37°C for 3h for evaluation of siRNA transcripts, or probed 
for target mRNAs at 42°C overnight. Blots were washed using standard 
30 methods and exposed to film overnight. In vitro studies were performed in 
triplicate with a minimum of two repeats. 

In vivo studies and tissue analyses. All animal procedures were 
approved by the University of Iowa Committee on the Care and Use of Animals. 
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Mice were injected into the taU vein (n = 10 per group) or into the brain (n = 6 
per group) as described previously (Stein, C.S., et al., J. Virol. iyMl^ZA19 
(1999)) with the virus doses indicated. Animals were sacrificed at the noted 
times and tissues harvested and sections or tissue lysates evaluated for P- 
5 glucuronidase expression, eGEP fluorescence, or P-galactosidase activity using 
established methods pQa, H. et al., Nat. Biotechnol. 19:640-644 (2001)). Total 
RNA was harvested fi-om transduced Uver using the methods described above. 

Cell Lines. PC12 tet off cell lines (Clontech Inc., Palo Alto, CA) were 
stably transfected with a tetracycUne regulatable plasmid into which was cloned 
10 GFPQ19 or GFPQ80 (Chai, Y. et al., J. Neurosci. 19:10338-10347 (1999)). For 
GEP-Q80, clones were selected and clone 29 chosen for regulatable properties 
and inclusion formation. For GFP-Q 19 clone 15 was selected for uniformity of 
GEP expression following gene expression induction. In all studies 1.5 ^g/ml 
dox was used to repress transcription. All experiments were done in triplicate 
1 5 and were repeated 4 tunes. 
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Results and Discussion 

To accomplish intracellular expression of siRNA, a 21-bp hairpin 
representing sequences directed against eGFP was constructed, and its abihty to 
reduce target gene expression in mammalian cells using two distinct constructs 
was tested. Initially, the siRNA hairpm targeted against eGFP was placed under 
Ihe control of the CMV promoter and contained a full-length SV-40 
polyadenylation (polyA) cassette (pCMVsiGFPx). hi the second construct, the 
hairpin was juxtaposed ahnost immediate to the CMV transcription start site 
(within 6 bp) and was followed by a synthetic, minimal polyA cassette (Fig. 1 A. 
pmCMVsiGFPmp A) (Experimental Protocols), because we reasoned that 
functional siRNA would require minimal to no overhangs (Caplan, N.J., et al., " 
Proc. Natl. Acad. Set U. S. A. 98:9742-9747 (2001); NykSnen, A., et al.. Cell 
107:309-321 (2001)). Co-transfection of pmCMVsiGEPmpA with pEGFPNl 
(Clontech hic) into HEK293 cells markedly reduced eGFP fluorescence (Fig. 
IC). .pr^CMVsiGFPmpA transfection led to the production of an approximately 
63 bp RNA specific for eGFP (Fig. ID), consistent with the predicted size of the 
siGFP haiipin-containing transcript. Reduction of target mRNA and eGFP 
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10 



protein expression was noted in pmCMVsiGFPmpA-transfected cells only (Fig. 
IE, F). In contrast, eGFP RNA, protein and fluorescence levels remained 
unchanged in cells transfected with pEGFPNl and pCMVsiGFPx (Fig. IE, G), 
pEGFPNl and pCMVsiBglucmpA (Fig. IE, F. H), or pEGFPNl and 
pCMVsiflgalmpA, the latter expressing siRNA against E. coli 6-galactosidase 
(Fig. IE). These data demonstrate the specificity of the expressed siRNAs. 

Constructs identical to pmCMVsiGFPmpA except that a spacer of 9, 12 
and 21 nucleotides was present between the transcription start site and the 21 bp 
hairpin were also tested. Li each case, there was no silencing of eGFP 
expression (data not shown). Together the results indicate that the spacing of the 
hairpin immediate to the promoter can be important for functional target 
reduction, a fiict supported by recent studies in MCF-7 cells (Brummelkamp, 
T.R., et aL, Science 296:550-553 (2002)). 

Recombinant adenoviruses were generated firom the siGFP 
15 .(pmCMVsiGFPmpA) and sipgluc (pmCMVsipglucmpA) plasmids (Xia, H., et 
al., Nat. Biotechnol. 19:640-644 (2001); Anderson, R.D.. et al., Gene Ther. 
7: 1034-1038 (2000)) to test the hypothesis that virally expressed siRNA aUows 
for diminished gene expression of endogenous targets in vitro and in vivo. HeLa 
cells are of human origin and contain moderate levels of the soluble lysosomal 
20 enzyme p-glucuronidase. hifection of HeLa cells with viruses expressing 

sipgluc caused a specific reduction in human fi-glucuionidase mRNA (Fig. \J) 
leading to a 60% decrease in p-glucuronidase activity relative to siGFP or 
control cells (Fig 1 J). Optimization of siRNA sequences using methods to refine 
target mRNA accessible sequences (Lee, N.S., et al.. Nat. Biotechnol. 19:500- 
505 (2002)) could improve fiirther the diminution of 8-glucuronidase transcript 
and protein levels. 

The results in Fig. 1 are consistent with earUer work demonstrating the 
ability of synthetic 21-bp double stranded RNAs to reduce expression of target 
genes in mammalian cells foUowing b^fection, with the important difference 
tiiat in tiie present stiidies tiie siRNA was syntiiesized intraceUularly fi'om readily 
available.promoter constructs. The data support tiie utiUty of regulatable, tissue 
or ceU-specific promoters for expression of siRNA when suitably modified for 
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close juxtaposition of the hairpin to the transcriptional start site and inclusion of 
the minimal polyA sequence containing cassette (^ee, Methods above). 

To evaluate the abiUty of virally expressed siRNA to diminish target- 
gene expression in adult mouse tissues in vivo, transgenic mice expressing eGFP 
5 (Okabe, M. et al., FEES Lett 407:313-319 (1997)) were injected into the striatal 
region of the brain with 1x10^ infectious units of recombinant adenovirus 
vectors expressing siGFP or control sipgluc. Viruses also contained a dsRed 
expression cassette in a distant region of the virus for unequivocal localization of 
the injection site. Brain sections evaluated 5 days after injection by fluorescence 

10 (Fig. 2A) or westem blot assay (Fig. 2B) demonstrated reduced eGFP 
expression- Decreased eGFP expression was confined to the injected 
hemisphere (Fig. 2B). The in vivo reduction is promising, particxilarly since 
transgenically expressed eGFP is a stable protein, making complete reduction in 
this short time fi-ame unlikely. Moreover, evaluation of eGFP levels was done 5 

1 5 days after injection, when inflanamatory changes induced by the adenovirus 
vector likely enhance transgenic eGFP expression from the CMV enhancer 
(Ooboshi, H., et al, Arterioscler Thromb. Vase, Biol 17:1786-1792 (1997)). 

It was next tested whether virus mediated siRNA could decrease 
expression from endogenous alleles in vivo. Its ability to decrease P- 

20 glucuronidase activity in the murine Uver, where endogenous levels of this 

relatively stable protem are high, was evaluated. Mice were injected via the tail 
vein with a construct expressing murine-specific siPgluc (AdsiMupgluc), or the 
control viruses Adsipgluc (specific for human p-glucuronidase) or Adsipgal. 
Adenoviruses injected into the tail vein transduced hqiatocytes as shown 

25 previously (Stein, C.S., et al., 1 Virol, 73:3424-3429 (1999)). Liver tissue 

harvested 3 days later showed specific reduction of target 6-glucuronidase RNA 
in AdsiMuBgluc treated mice only (Fig. 2C). Fluorometric enzyme assay of 
liver lysates confirmed these results, with a 12% decrease in activity from liver 
harvested from AdsiMuPgluc injected mice relative to Adsipgal and AdsiPgJuc 

30 treated ones (p<0.01; n=10). Interestingly, sequence differences between the 
murine and human siKNfA constmcts are limited, with 14 of 21 bp being 
identical. These results confirm flie specificity of virus mediated siRNA, and 
indicate that allele-specific applications are possible. Together, flie data are the 
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first to demonstrate the utility of siRNA to diminish target gene expression in 
brain and liver tissue in vivo, and establish that allele-specific silencing in vivo is 
possible with siRNA. 

One powerful therapeutic application of siRNA is to reduce expression of 
5 toxic gene products in dominantly inherited diseases such as the polyglutamine 
(polyQ) neurodegenerative disorders (Margolis, KL. & Ross, C. A. Trends Mol 
Med. 7:479-482 (2001)). The molecular basis of polyQ diseases is a novel toxic 
property conferred upon the mutant protein by polyQ expansion. This toxic 
property is associated wifli disease protein aggregation. The ability of viially 
1 0 e3q)ressed siRNA to diminish e)q>anded polyQ protein expression in neural PC- 
12 clonal cell Unes was evaluated. Lines were developed that express 
tetracycline-repressible eGFP-polyglutamine fusion proteins with normal or 
expanded glutamine of 19 (eGFP-Q19) and 80 (eGFP-Q80) repeats, respectively. 
Differentiated, eGFP-Q19-expressing PC12 neural cells infected with 
1 5 recombinant adenovirus expressing siGFP demonstrated a spiecific and dose- 
dependent decrease in eGFP-Q19 fluorescence (Fig. 3A, C) and protein levels 
(Fig. 3B). Application of Adsipgluc as a control had no effect (Fig, 3A-C). 
Quantitative image analysis of eGFP fluorescence demonstrated that siGFP 
reduced GFPQ19 expression by greater than 96% and 93% for 100 and 50 MOI 
20 respectively, relative to control siEGSf A (Fig. 3C). The multiplicity of infection 
(MOI) of 100 required to achieve maximal inhibition of eGFP-Q19 expression 
results largely fix)m the inability of PC12 ceDs to be mfected by adenovinis- 
based vectors. This barrier can be overcome using AAV- or lentivirus-based 
expression systems (Davidson, B.L., et al., Proa Natl Acad, Sci. U, S. A. 
25 97:3428-3432 (2000); Brooks, A.L, et al, Proc. Natl Acad Sal U, S. A, 
" 99:6216-6221(2002)). 

To test the impact of siRNA on the size and number of aggregates 
formed in eGFP-Q80 expressing cells, differentiated PC-12/eGFP-Q80 neural 
cells were infected with AdsiGFP or Adsipgluc 3 days after doxycycline 
30 removal to induce GFP-Q80 expression. Cells were evaluated 3 days later. In 
mock-infected control cells (Fig. 4A), aggregates were very large 6 days after 
induction as reported by others (Chai, Y., et al., J, Neurosci. 19:10338-10347 
(1999; Moulder, K.L., et al., 1 Neurosci. 19:705-715 (1999)). Large aggregates 
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were also seen in cells infected with AdsiPgluc (Fig. 4B), AdsiGFPx, (Fig. 4C, 
siRNA expressed from the normal CMV promoter and containing the SV40 
large T antigen polyadenylation cassette), or Adsipgal (Fig. 4D). In contrast, 
polyQ aggregate fonnation was significantly reduced in AdsiGFP infected cells 
5 (Fig. 4E), with fewer and smaller inclusions and more difiuse eGFP 

fluorescence. AdsiGFP-mediated reduction in aggregated and monomeric GFP- 
Q80 was verified by Westem blot analysis (Fig. 4F), and quantitation of cellular 
fluorescence (Fig. 4G). AdsiGFP caused a dramatic and specific, dose- 
dependent reduction in eGFP-Q80 expression (Fig. 4F, G). 
10 It was found that transcripts expressed from the modified CMV promoter 

and containing the minimal polyA cassette were capable of reducing gene 
expression in both plasmid and viral vector systems (Figs. 1 -4). The placement 
of the hairpin immediate to the transcription start site and use of the minimal 
polyadenylation cassette was of critical importance. In plants and Drosophila, 
15 RNA interference is initiated by the ATP-dependent, processive cleavage of long 
dsRNA into 21-25 bp double-stranded siRNA, followed by incorporation of 
siKNA into a RNA-induced silencing complex that recognizes and cleaves the 
target (Nykanen, A., et al.. Cell 107:309-321 (2001); Zamore, PD., et al. Cell 
101:25-33 (2000); Bernstein, E., et al., Nature 409:363-366 (2001); Hamilton, 
20 A,J. & Baulcombe, D.C. Science 286:950-952 (1999); Hammond, S.M. et al.. 
Nature 404:293-296 (2000)). Viral vectors expressing siRNA are usefiil in 
determining if similar mechanisms are involved in target RNA cleavage in 
mammaUan ceUs in vivo. 

In summary, these data demonstrate that siRNA expressed from viral 
25 vectors in vitro and in vivo specifically reduce expression of stably expressed 
plasmids in cells, and endogenous transgenic targets in mice. Importantly, the 
^plication of virally expressed siRNA to various target alleles in different cells 
and tissues in vitro and in vivo was demonstrated. Finally, the results show that 
it is possible to reduce polyglutamine protein levels in neurons, which is the 
30 cause of at least nine inherited neurodegenerative diseases, with a corresponding 
decrease in disease protein aggregation. The abiUty of viral vectors based on 
adeno-associated virus (Davidson, B.L., et al., Proc, Natl Acad. ScL U, S. A, 
97:3428-3432 (2000)) and lentiviruses (Brooks, A.L, et al., Proc. Natl Acad 
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Sd. U. S. A. 99:6216-6221 (2002)) to efficiently transduce cells in the CNS, 
coupled with the effectiveness of virally-expressed siKNA demonstrated here, 
extends the ^plication of siRNA to viral-based thaapies and to basic researeh, 
including inhibiting novel ESTs to define grate function. 

5 

Example 2 

siRNA Sunnresinn nf Genes Involved in MJD/Sr A3 and FmP-l? 

Modulation of gene expression by endogenous, noncoding RNAs is 
10 increasingly appreciated to play a role in eukaryotic development, maintenance 
of chromatin structure and genomic mtegrity. Recently, techniques have been 
developed to trigger RNA interference (RNAi) against specific targets in 
mammalian cells by introducing exogenously produced or intracellularly 
expressed siRNAs. These methods have proven to be quick, inexpensive and 
15 effective for knockdown experiments i/ivf/ro and wWvo. The ability to 

accompUsh selective gene silencing has led to the hypothesis that siRNAs might 
be employed to suppress gene expression for therapeutic benefit. 

Dominantly inherited diseases are ideal candidates for siRNA-based 
ther^y. To explore the utiUty of siRNA in inherited human disorders, the 
20 inventors employed cellular models to test whether we could target mutant 
alleles causing two classes of dominantly inherited, untreatable 
neurodegenerative diseases: polyglutamine (polyQ) neurodegeneration in 
MJD/SCA3 and fixmtotemporal dementia with parkinsonism linked to 
chromosome 17 (FTDP-17). The polyQ neurodegenerative disorders consist of 
25 at least nine diseases caused by CAG repeat expansions that encode polyQ in the 
disease protein. PolyQ expansion confers a dominant toxic property on the 
mutant protein that is associated with aberrant accumulation of the disease 
protein in neurons. In FTDP-17, Tau mutations lead to the fonnation of 
neurofibrillary tangles accony)anied by neuronal dysfunction and degeneratioa 
30 The precise mechanisms by which these mutant proteins cause neuronal injury 
are unknown, but considerable evidence suggests that the abnormal proteins 
themselves initiate (he pathogenic process. Accordingly, eUminatbg expression 
of the mutant protein by siRNA or other means should, in principle, slow or even 
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prevent disease. However, because many dominant disease genes may also 
encode essential proteins, the inventors sought to develop siRNA-mediated 
approaches that selectively inactivate mutant alleles while allowing contimied 
expression of the wild type protem. 

5 

Methods 

siRNA Synthesis. In vitro siKNA synthesis was previously described 
(Donze 2000). Reactions were performed with desalted DNA oUgonucleotides 
(IDT Coralville, lA) and the AmpliScribeT? High Yield Transcription Kit 

10 (Epicentre Madison, WT). Yield was detemiined by absoibance at 260nm. 
Annealed siRNAs were assessed for double stranded character by agarose gel 
(1% w/v) electrophoresis and ethidimn bromide staining. Note that for all 
siRNAs generated in this study the most 5' nucleotide in the targeted cDNA 
sequence is referred to as position 1 and each subsequent nucleotide is numbered 

1 5 in ascending order from 5' to 3'. 

Plasmid Construction. The human ataxin-3 cDNA was expanded to 166 
CAG's by PGR (Laccone 1999). PGR products were digested at BamHI and 
Kpnl sites introduced during PGR and ligated into BgUI and Kpnl sites of 
pEGFP-Nl (Clontech) resulting in full-length expanded ataxin-3 fiised to the N- 

20 terminus of EGFP. Untagged Ataxin-3-Q166 was constmcted by ligating a 

PpuMI-NotI ataxin-3 fragment (3' of the GAG repeat) into Ataxin-3-Q166-GFP 
cut with PpuMI and NotI to remove EGFP and replace the normal ataxin-3 stop 
codon. Ataxin-3-Q28-GFP was generated as above from pcDNA3.1-ataxin-3- 
Q28. Gonstructs were sequence verified to ensure that no PGR mutations were 

25 present. Expression was verified by Western blot with anti-ataxin-3 (Paulson 
1997) and GFP antibodies (MBL). The construct encoding a flag tagged, 352 
residue tau isoform was previously described (Leger 1994), The pEGFP-tau 
plasmid was constmcted by ligating the human tau cDNA into pEGFP-G2 
(Glontech) and encodes tau with EGFP fused to the amino terminus. The 

30 pEGFP-tauV3 37M plasmid was derived using site-directed mutagenesis 
(QuikGhange Kit, Stratagene) of the pEFGP-tau plasmid. 

Cell Culture and Transfections. Culture of Gos-7 and HeLa cells has 
been described (Chai 1999b). Transfections with plasmids and siRNA were 
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performed using Lipofectamine Plus (LifeTechnologies) according to the 
manufacturer's instructions. For ataxin-3 expression 1.5 fig plasmid was 
transfected with 5jig in vitro synthesized siRNAs. For Tau experiments Ijig 
plasmid was transfected with 2.5Hg siRNA. For expression of hairpin siKNA 
5 from the phU6 constructs, 1 ixg ataxin-3 expression plasmid was transfected with 
4\ig phU6-siC10i or phU6-siG10i. Cos-7 cells infected with siRNA-expressing 
adenovirus were transfected with 0.5^g of each expression plasmid. 

Stably transfected, doxycycline-inducible cell lines were generated in a 
subclone of PC12 cells, PC6-3, because of its strong neural diOFerentiation 
10 properties (Pittman 19938). A PC6-3 clone stably expressing Tet repressor 
plasmid (provided by S, Strack, Univ. of Iowa), was transfected with 
pcDNA5/TO-ataxin-3(Q28) orpcDNA5/TO-ataxin-3(Q166) (Invitrogen). After 
selection in hygromycin, clones were characterized by Western blot and 
immunofluorescence. Two clones, PC6-3-ataxin3(Q28)#33 and PC6-3- 
15 ataxin3(Q166)#41, were chosen because of their tightly inducible, robust 
expression of ataxin-3. 

siRNA Plasmid and Viral Production. Plasmids expressing ataxin-3 
shRNAs were generated by insertion of head-to-head 21 bp hairpins in phU6 that 
corresponded to siClO and siGlO (Xia 2002). 
20 Recombinant adenovirus expressing ataxin-3 specific shRNA were 

generated from phU6-C10i (encoding CIO hairpin siRNA) and phU6si-G10i 
(encoding GIO hairpin siRNA) as previously described (Xia 2002, Adnerson 
2000). 

Western Blotting and Immunofluorescence. Cos-7 cells expressing 
25 ataxm-3 were harvested 24-48 hours after transfection (Chai 1999b). Stably 
transfected, inducible cell Unes were harvested 72 hours after infection with 
adenovirus. Lysates were assessed for ataxin-3 expression by Westem blot 
analysis as previously described (Chai 1999b), using polyclonal rabbit anti- 
ataxin-3 aQtisera at a 1:15,000 dilution or 1C2 antibody specific for expanded 
30 polyQ tracts (Trottier 1 995) at a 1 :2,500 dilution. CeUs expressing Tau were 
harvested 24 hours after transfection. Protein was detected with an affinity 
purified polyclonal antibody to a human tau peptide (residues 12-24) at a 1 :500 
dilution. Anti-alpha-tubulin mouse monoclonal antibody (Sigma St. Louis, MO) 

72 



wo 2004/013280 



PCT/US2003/016887 



was used at a 1: 10,000 dilution and GAPDH mouse monoclonal antibody 
(Sigma St. Louis, MO) was used at a 1:1,000 dilution. 

Immunofluorescence for ataxin-3 (Chai 1999b) was carried out using 
1C2 antibody (Chemicon Intemational Temecula, CA) at 1 : 1,000 dilution 48 
5 hours after transfection. Flag-tagged, wild type tau was detected using mouse 
monoclonal antibody (Sigma St Louis, MO) at 1:1,000 dilution 24 hours after 
transfection. Botti proteins were detected with rhodamine conjugated secondary 
antibody at a 1:1,000 dilution. 

Fluorescent Imaging and Quantification. Fixed samples were observed 
10 with a Zeiss Axioplan fluorescmce microscope. Digital images were collected 
on separate red, green and blue fluorescence channels using a SPOT digital 
camera. Images were assembled and overlaid using Adobe Photoshop 6.0. Live 
cell images were collected witii a Kodak MDS 290 digital camera mounted to an 
Olympus (Tokyo, Japan) CK40 inverted microscope. Fluorescence was 

1 5 quantitated by collecting 3 non-overlapping images per well at low power (1 Ox). 
Pixel count and intensity for each image was determined using Bioquant Nova 
Prime software (BIOQUANT Image Analysis Corporation). Background was 
subtracted by quantitation of images from cells of equivalent density under 
identical fluorescent illumination. Mock transfected cells were used to assess 

20 background fluorescence for all experiments and were stained witii appropriate 
primary and secondary antibodies for simulated heterozygous experiments. 
Average fluorescence is reported from 2 to 3 independent experiments. The 
mean of 2 to 3 independent experiments for cells transfected with the indicated 
expression plasmid and siMiss was set at one. Errors bars depict variation 

25 between experiments as standard error of the mean. In simulated heterozygous 
experiments, a blinded observer scored cells with a positive fluorescence signal 
for expression of wild type, mutant or both proteins in random fields at high 
power for two independent experiments. More than 100 cells were scored in 
each experiment and reported as number of cells witii co-expression divided by 

30 total niunber of transfected cells. 
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Results 

Direct Silencing of Expanded Alleles. The inventors first attempted 
suppression of mutant polyQ expression using siRNA complementary to the 
CAG repeat and immediately adjacent sequences to determine if the expanded 
5 repeat differentially altered the susceptibility of the mutant allele to siRNA 
inhibition (Figure 6). HeLa cells were transfected with various in vitro 
synthesized siRNAs (Danze 2002) and plasmids encoding normal or expanded 
polyQ fused to red or green fluorescent protein, respectively (Q19-RFP and 
Q80-GFP) (Fig. 5a). In negative control cells transfected with Q80-GFP, Q19- 
10 RFP and a mistargeted siRNA (siMiss), Q80-GFP formed aggregates (Onodera 
1997) which recruited the normally diffuse Q19-RFP (Fig 5a). When the 
experiment was performed with siRNA targeted to GFP as a positive control for 
allele specific silencing, Q80-GFP expression was nearly abolished while Q19- 
RFP continued to be expressed as a diffusely distributed protem (Fig. 5a). When. 
15 Q19-RFP and Q80-GFP were co-transfected with siRNA directly targeting the 
CAG repeat (siCAG) (Fig. 5a) or an immediately adjacent 5' region (data not 
shown), expression of both proteins was efficiently suppressed. 

To test whether siRNA could selectively silence expression of a full- 
length polyQ disease protein, siRNAs were designed that target the transcript 
20 encoding ataxin-3, the disease protein in Machado- Joseph Disease, also known 
as Spinocerebellar Ataxia Type 3 (MJD/SCA3) (Zoghbi 2000) (Fig. 5b). In 
transfected cells, siRNA directed against three separate regions - the CAG 
repeat, a distant 5' site, or a site just 5* to the CAG repeat (siN'CAG) - resulted 
in efficient, but not allele-specific, suppression of ataxin-3 containing normal or 
25 expanded repeats (data not shown). Consistent with an earlier study using longer 
dsRNA (Caplen 2002) the present results show that expanded CAG repeats and 
adjacent sequences, while accessible to RNAi, may not be preferential targets for 
silencing. 

Allele-specific SUencing of the Mutant PolyQ Gene in MJD/SCA3. In 

30 further efforts to selectively inactivate the mutant allele the inventors took 

advantage of a SNP in the MJDl gene, a G to C transition immediately 3* to the 
CAG repeat (G987C) (Fig. 5b). This SNP is in linkage disequaibrimn witii the 
disease-causing expansion, in most families segregating perfectly with the 
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disease aUele. Worldwide, 70% of disease chromosomes cany the C variant 
(Caspar 2001). The present ataxin-3 expression cassettes, which were generated 
from patients (Paulson 1997), contain the C variant in all expanded ataxin-3 
constructs and the G variant in all nomial ataxin-3 constructs. To test whether 
5 this G-C mismatch could be distinguished by siRNA, siRNAs were designed that 
included the last 2 CAG triplets of the repeat followed by the C variant at 
position 7 (siC7) (Figure 6 and Fig. 5b), resulting in a perfect match only for 
expanded alleles. Despite the presence of a single mismatch to the wild type 
allele, siC7 strongly mhibited expression of both alleles (Fig. 5c,d). A second G- 
10 C mismatch was then introduced at position 8 such that the siRNA contained 
two mismatches as compared to wild type and only one mismatch as compared 
to mutant alleles (siC7/8). The siC7/8 siRNA effectively suppressed mutant 
ataxin-3 expression, reducing total fluorescence to an average 8.6% of control 
levels, with only modest effects on wild type ataxin-3 (average 75.2% of 
15 control). siC7/8 also nearly eliminated the accumulation of aggregated mutant 
ataxin-3, a pa&ological haUmaik of disease (Chan 2000) (Fig. 5d). 

To optimize differential suppression, siRNAs were designed containing a 
more centrally placed mismatch. Because the center of the antisense strand 
directs cleavage of target mRNA m the RNA Induced Silencing Complex 
20 (RISOcomplex (Elbashir 2001c), it was reasoned that CMitral mismatches might 
more efiBdentty discriminate between wild type and mutant alleles. siRNAs 
were designed that place the C of the SNP at position 10 (siClO). preceded by 
the final three triplets in the CAG repeat (Figure 6 and Fig. 5b). In transfected 
cells, siClO caused aUele-specific suppression of the mutant protein (Fig. 5c,d). 
25 Fluorescence from expanded Atx-3-Q166-GFP was dramatically reduced (7.4% 
of control levels), while fluorescence of Atx-3-Q28-GFP showed minimal 
change (93.6% of control; Fig. 5c.d). Conversely, siRNA engineered to suppress 
only the wild type allele (siGlO) inhibited wild type expression with Uttle effect 
on expression of the mutant allele (Fig. 5c,d). Inclusion of three CAG repeats at 
30 the 5" end of the siRNA did not inhibit e>q)ression of Q19-GFP, Q80-GFP, or 
full-length ataxin-l-Q30 proteins that are each encoded by CAG repeat 
containing transcripts (Fig. 7). 
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In fhe disease state, normal and mutant alleles are simultaneously 
expressed, la plants and wonns, activation of BNAi against one transcript results 
in the spread of silencing signals to oflier targets due to RNA-dq)endent KNA 
polymerase (RDRP) activity primed by the introduced RNA (Fire 1998, Tang 
5 2003). Although spreading has not been detected in mammalian cells and RDKP 
activity is not required for eflEective siRNA inhibition (Chiu 2002, Schwarz 
2002, Martinez 2002), most studies have used cell-free systems in which a 
mammalian RDRP could have been inactivated. If triggering the mammalian 
RNAi pathway against one allele activates cellular mechanisms that also silence 

10 the other allele, then siRNA ^phcations might be limited to non-essential genes. 
To test this possibility, the heterozygous state was simulated by co-transfecting 
Atx-3-Q28-GFP and Atx-3-Q166 and analyzing suppression by Western blot. As 
shown in Fig. 5e each siRNA retained the specificity observed in separate 
transfections: siC7 inhibited both alleles, siGlO inhibited only the wild type 

1 5 allele, and siC7/8 and siCl 0 inhibited only mutant allele expression. 

Effective siRNA therapy for late onset disease will likely require 
sustained intracellular expression of the siRNA. Accordingly, the present 
experiments were extended to two intracellular methods of siRNA production 
and delivery: expression plasmids and recombinant virus (Brummelkamp 2002, 

20 Xia 2002). Plasmids were constructed expressing siGlO or siClO siRNA from 
the human U6 promoter as a hairpin transcript that is processed intracellularly to 
produce siRNA (Brummelkamp 2002, Xia 2002). When co-transfected with 
ataxin-3-GFP expression plasmids, phU6-G10i andphU6-C10i-siRNA plasmids 
specifically suppressed wild type or mutant ataxin-3 expression, respectively 

25 (Fig. 5f). 

This result encouraged the inventors to engineer recombinant adenoviral 
vectors expressing allele-specific siRNA (Xia 2002). Viral-mediated 
suppression was tested in Cos-7 cells transiently transfected with both Atx-3- 
Q28-GFP and Atx-3-Q166 to simulate the heterozygous state. Cos-7 cells 
30 infected with adenovirus encoding siGlO, siClO or negative control siRNA (Ad- 
GlOi, Ad-ClOi, and Ad-LacZi respectively) exhibited allele-specific silencing of 
wild type ataxin-3 expression with Ad-GlOi and of mutant ataxin-3 with Ad- 
ClOi (Fig 8a,b,c). Quantitation of fluorescence (Fig. 8b) showed that Ad-GlOi 
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reduced wfld type ataxin-3 to 5.4% of control levels while mutant ataxin-3 
expression remained unchanged. Conversely, Ad-ClOi reduced mutant ataxin-3 
fluorescence levels to 8.8% of control and retained 97.4% of wild type signal. 
These results were confirmed by Western blot where it was further observed that 
5 Ad-GlOi virus decreased endogenous (primate) ataxin-3 while Ad-ClOi did not 
(Fig 8c). 

Viral mediated suppression was also assessed in differentiated PC12 
neural cell lines that inducibly express normal (Q28) or expanded (Q166) mutant 
ataxm-3. Following infection with Ad-GlOi, Ad-ClOi, or Ad-LacZi, 

10 differentiated neural cells were placed in doxycycline for three days to induce 
maximal expression of ataxin-3. Western blot analysis of cell lysates confirmed 
that the Ad-GlOi virus siq)pressed only wild type ataxin-3, Ad-ClOi virus 
suppressed only mutant ataxin-3, and Ad-LacZi had no effect on either normal or 
mutant ataxin-3 expression (Fig. 8d). Thus, siRNA retains its eflScacy and 

1 5 selectivity across different modes of production and delivery to achieve allele- 
specific silencingof ataxin-3. 

AUele-Specific Silencing of a Missense Tau Mutation. The preceding 
results indicate that, for DNA repeat mutations in v^ch the repeat itself does not 
present an effective target, an associated SNP can be exploited to achieve allele- 

20 specific silencing. To test whether siRNA works equally well to silence disease- 
causing mutations directly, the inventors targeted missense Tau mutations that 
cause FTDP-17 (Poorkaj 1998, Hutton 1998). A series of 21-24 nt siRNAs were 
generated in vitro against four missense FTDP-17 mutations: G272V, P301L, 
V337M, and R406W (Figure 6 and Fig 9a). In each case the point mutation was 

25 placed centrally, near the likely cleavage site in the RISC complex (position 9, 
10 or 1 1) (Laccone 1999). A fiflh siRNA designed to target a 5' sequence in all 
Tau transcripts was also tested. To screen for siKNA-mediated suppression, the 
inventors co-transfected GFP fusions of mutant and wild type Tau isoforms 
together with siRNA into Cos-7 cells. Of the five targeted sites, the inventors 

30 obtained robust suppression with siRNA corresponding to V337M (Figure 6 and 
Fig. 9A) (Poorkaj 1998, Hutton 1998), and thus focused finther analysis on this 
mutation. The V337M mutation is a G to A base change in the first position of 
the codon (GTG to ATG), and the corresponding V337M siRNA contains the A 
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missense change at position 9 (siA9). This intended V337M-specific siRNA 
preferentially silenced the mutant allele but also caused significant suppression 
of wild type Tau (Fig. 9b,c). 

Based on the success of this approach with ataxin-3, the inventors 
5 designed two additional siSNAs that contained the V337M (G to A) mutation at 
. position 9 as well as a second introduced G-C mismatch immediately S' to the 
mutation (siA9/C8) or three nucleotides 3* to the mutation (siA9/C12), such that 
the siRNA now contained two mismatches to the wild type but only one to the 
mutant allele. This strategy resulted in further preferential inactivation of the 

10 mutant allele. One siRNA, siA9/C12, showed strong selectivity for the mutant 
tau allele, reducing fluorescence to 12.7% of control levels without detectable 
loss of wild type Tau (Fig. 9b,c). Next, we simulated the hdterozygous state by 
co-transfecting V337M-GFP and flag-tagged WT-Tau expression plasmids (Fig. 
10). In co-transfected HeLa cells, siA9/C12 silenced the mutant allele (16.7% of 

1 5 control levels) with minimal alteration of wild type expression assessed by 
fluorescence (Fig. 10a) and Western blot (Fig. 10b). In addition, siA9 and 
siA9/C8 displayed better allele discrimination than we had observed in separate 
transfections, but continued to suppress both wild type and mutant tau 
expression (Fig. 10a,b,c). 

20 

Discussion 

Despite the rapidly growing siRNA literature, questions remain 
concerning the design and application of siRNA both as a research tool and a 
therapeutic strategy. The present study, demonstrating allele-specific silencing of 

25 dominant disease genes, sheds light on important aspects of both applications. 

Because many disease genes encode essential proteins, development of 
strategies to exclusively inactivate mutant alleles is important for the general 
application of siRNA to dominant diseases. The present results for two unrelated 
disease genes demonstrate that in mammalian cells it is possible to silence a 

30 single disease allele without activating pathways analogous to those found in 
plants and worms that result in the spread of silencing signals (Fire 1998, Tang 
2003), 
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In summary, siRNA can be engineered to silence expression of disease 
alleles differing from wild type alleles by as little as a single nucleotide. This 
approach can directly target missense mutations, as in frontotemporal dementia, 
or associated SNPs, as in MJD/SCA3. The present stepwise strategy for 
5 optimizing allele-specific targeting extends the utility of siKNA to a wide range 
of dominant diseases in which the disease gene normally plays an important or 
essential role. One such example is the polyglutamine disease, Huntington 
disease (HD), in which normal HD protein levels are developmentally essential 
(Nasir 1995). The availability of mouse models for many dominant disorders, 
10 including MJD/SCA3 (Cemal 2002), HD (Lm 2001), and FTDP-l? (Tanemura 
2002), allows for the in ^nvo testing of siRNA-based therapy for these and other 
human diseases. 



Example 3 

15 Therapy for DYTl dystonia: Allele-specific silencing of mutant 

TorsinA 

DYTl dystonia is the most common cause of primary generalized 
dystonia. A dominantly inherited disorder, DYTl usually presents in childhood 
as focal dystonia that progresses to severe generalized disease. With one possible 

20 exception, all cases of DYTl result fiom a common GAG deletion in TORI A, 
eliminating one of two adjacent glutamic acids near the C-terminus of the 
protem TorsinA (TA). Although the precise cellular function of TA is unknown, 
it seems clear that mutant TA (TAmut) acts througji a dominant-negative or 
dominant-toxic mechanism. The dominant nature of the genetic defect in DYTl 

25 dystonia suggests that eflforts to silence e3q)ression of TAmut should have 
potential therapeutic benefit. 

Several characteristics of DYTl make it an ideal disease in which to 
explore siRNA-mediated gene silencing as potential therapy. Of greatest 
importance, the dominant nature of the disease suggests that a reduction in 

30 mutant TA, whatever the precise pathogenic mechanism proves to be, will be 
helpful. Moreover, the existence of a single common mutation that deletes a full 
three nucleotides suggests it may be feasible to design siKNA that will 
specifically target the mutant allele and will be applicable to all affected persons. 
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Finally, there is no effective therapy for DYTl , a relentless and disabling 
disease. Thus, any therapeutic ^preach with promise needs to be explored. 
Because TAwt may be an essential protein, however, it is critically important 
that efforts be made to silence only the mutant allele. 
5 . In the studies reported here , the inventors explored the utility of siRNA 

for DYTl . As outlined in the strategy m Figure 1 1 , the inventors sought to 
develop siRNA that would specifically eliminate production of protein from the 
mutant allele. By exploiting the three base pair difference between wild type 
and mutant alleles, the mventors successfully silenced expression of TAmut 
10 withput interfering with expression of the wild type protein (TAwt). 

Methods 

siRNA design and synthesis Small-interfering RNA duplexes were 
synthesized in vitro according to a previously described protocol (Donze 2002), 

15 using AmpliScribeT? High Yield Transcription Kit (Epicentre Technologies) 
and desalted DNA oligonucleotides (IDT). siRNAs were designed to target 
different regions of human TA transcript: 1) an upstream sequence common to 
both TAwt and TAmut (com-siKNA); 2) the area corresponding to the mutation 
with either the wild type sequence (wt-siRNA) or the mutant sequence 

20 positioned at three different places (mutA-siRNA, mutB-siRNA, mutC-siRNA); 
and 3) a negative control siRNA containing an irrelevant sequence that does not 
target any region of TA (mis-siRNA). The design of the primers and targeted 
sequences are shown schematically in Figure 12. After in vitro synthesis, the 
double stranded structure of the resultant RNA was confirmed in 1.5 % agarose 

25 gels and RNA concentration determined with a SmartSpect 3000 UV 
Spectrophotometer (BioRad). 

Plasmids pcDNA3 containing TAwt or TAmut cDNA were kindly 
provided by Xandra Breakefield (Mass General Hospital, Boston, MA). This 
construct was produced by cloning the entire coding sequences of human 

30 TorsinA (1-332), both wild-type and mutant (GAG deleted), into the mammalian 
expression vector, pcDNA3 (Clontech, Palo Alto, CA). Using PGR based 
strategies, an N-terrainal hemagglutinin (HA) epitope tag was inserted into both 
constructs. pEGFP-C3-TAwt was kindly provided by PuUanipally Shashidharan 
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(Mt Sinai Medical School, NY). This construct was made by inserting the full- 
length coding sequence of wild-type TorsinA into the EcoRI and BamHI 
restriction sites of the vector pEGFP-C3 (Clontech). This resulted in a fusion 
protein including eGFP, three "stuffer" amino acids and the 331 amino acids of 
5 TorsinA. HA-tagged TAmut was inserted into the Apal and Sail restriction sites 
of pEGFP-Cl vector (Clontech), resulting in a GFP-HA-TAmut construct. 

Cell culture and transfections Methods for cell culture of Cos-7 have 
been described previously (Chai 1999b). Transfections with DNA plasmids and 
siRNA were performed using Lipofectamine Plus (LifeTechnologies) according 

10 to the manufacturer's instructions in six or 12 well plates with cells at 70-90% 
confluence. For single plasmid transfection, 1 ng of plasmid was transfected 
with 5ng of siRNA. For double plasmid transfection, 0.75 |ig of each plasmid 
was transfected with 3.75 jig of siRNA. 

Western Blottmg and Fluorescence Microscopy. Cells were harvested 

15 36 to 48 hours after transfection and lysates were assessed for TA expression by 
Western Blot analysis (WB) as previously described (Chai 1999b). The antibody 
used to detect TA was polyclonal rabbit antiserum generated against a TA- 
maltose binding protein fusion protein (kindly provided by Xandra Breakefield) 
at a 1 :500 dilution. Additional antibodies used in the experiments described here 

20 are the anti-HA mouse monoclonal antibody 12CA5 (Roche) at 1 : 1 ,000 dilution, 
monoclonal mouse anti-GFP antibody (MBL) at 1:1,000 dilution, and for 
loading controls, anti a-tubulin mouse monoclonal antibody (Sigma) at 1 :20,000 
dilution. 

Fluorescence visualization of fixed cells expressing GFP-tagged TA was 
25 performed with a Zeiss Axioplan fluorescence microscope. Nuclei were 

visualized by staining with S^g/ml DAPI at room temperature for 10 minutes. 
Digital images were collected on separate red, grera and blue fluorescence 
channels using a Diagnostics SPOT digital camera. live cell images were 
collected with a Kodak MDS 290 distal camera mounted on an 01ynq)us CK40 
30 inverted noicroscope equipped for GFP fluorescence and phase contrast 
microscopy. Digitized images were assembled using Adobe Photoshop 6.0. 

Western Blot and Fluorescence Quantification. For quantification of 
WB signal, blots were scanned with a Hewlett Packard ScanJet SIOOC scanner. 
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The pixel count and intensity of bands correspondiag to TA and a-tubulin were 
measured and the background signal subtracted using Scion Image software 
(Scion Corporation). Using the a-tubulin signal from control lanes as an iutemal 
reference, the TA signals were normalized based on the amount of protein 
5 loaded per lane and the result was expressed as percentage of TA signal in the 
control lane. Fluorescence quantification was detemiined by collecting three 
non-overlapping images per well at low power (lOx), and assessing the pixel 
count and intensity for each image with Bioquant Nova Prime software 
(BIOQUANT Image Analysis Corporation), Background fluorescence, which 
10 was subtracted from experimental images, was determined by quantification of 
fluorescence images of untransfected cells at equivalent confluence, taken under 
identical illumination and exposure settings. 

RESULTS 

15 Expression of tagged TorsinA constructs. To test whether allele-specific 

silencing could be appUed to DYTl, a way to differentiate TAwt and TAmut 
proteins needed to be developed. Because TAwt and TAmut display identical 
mobility on gels and no isoform-specific antibodies are available, amino- 
terminal epitope-tagged TA constructs and GFP-TA fiision proteins were 

20 generated that would allow distinguishingTAwt and TAmut. The use of GFP-TA 
fusion proteins also faciUtated the abiUty to screen siRNA suppression because it 
allowed visualization of TA levels in living cells over time. 

In transfected Cos-7 cells, epitope-tagged TA and GFP-TA fiision protem 
expression was confirmed by using the q)propriate anti-epitope and anti-TA 

25 antibodies. Fluorescence microscopy in living cells showed that GFP-TAwt and 
GFP-TAmut fiision proteins were expressed diffusely in the cell, primarily in the 
cytoplasm, although perinuclear inclusions were also seen. It is important to note 
that these constmct were designed to express reporter proteins in order to assess 
allele-specific RNA interference rather than to study TA fimctioiL The N- 

30 terminal epitope and GFP domains likely disrupt the normal signal peptide- 
mediated translocation of TA mto the lumen of the endoplasmic reticulum, 
where TA is thought to fiinction. Thus, while these constructs facilitated 



82 



wo 2004/013280 



PCT/US2003/016887 



expression analysis in the studies described here, they are of limited utility for 
studying TA function. 

SOendng TorsuiA wUh siRNA. Various siRNAs were designed to test 
the hypothesis that siRNA-mediated suppression of TA expression could be 
5 achieved in an allele-specific manner (figure 12). Because siRNA can display 
exquisite sequence specificity, flie three base pair difiference between mutant and 
wild type TORI A alleles migjit be sufficient to permit the design of siRNA that 
preferentially recognizes mRNA derived from the mutant allele. Two siRNAs 
were initially designed to target TAmut (mutA-siRNA and mulB-siRNA) and 

10 one to target TAwt (wt-siRNA). In addition, a positive control siRNA was 

designed to silence both alleles (com-siRNA) and a negative control siRNA of 
irrelevant sequence (mis-siRNA) was designed. Cos-7 cells were first 
cotransfected with siRNA and plasmids encoding either GFP-TAwt or untagged 
TAwt at a siRNA to plasmid ratio of 5 : 1 . With wt-siRNA, potent silencing of 

15 TAwt expression was observed to less than 1 % of control levels, based on 
western blot analysis of cell lysates (Figures 13A and 13C). With com-siRNA, 
TAwt expression was suppressed to --30 % of control levels. In contrast, mutA- 
siRNA did not suppress TAwt and mutB-siRNA suppressed TAwt expression 
only modestly. These results demonstrate robust suppression of TAwt expression 

20 by wild type-specific siRNA but not mutant-specific siRNA. 

To assess suppression of TAmut, the same siRNAs were cotransfected 
with plasmids encoding untagged or HA-tagged TAmut. With mutA-siRNA or 
mutB-siRNA, marked, though somewhat variable, suppression of TAmut 
expression was observed as assessed by westem blot analysis of protein levels 

25 (Figure 13B and 13C). With com-siRNA, suppression of TAmut expression was 
observed similar to what was observed with TAwt expression. In contrast, wt- 
siRNA did not suppress expression of TAmut. Thus differential suppression of 
TAmut expression was observed by allele-specific siRNA in precisely the 
manner anticipated by the inventors. 

30 To achieve even more robust silencing of TAmut, a third siRNA was 

engmeered to target TAmut (mutC-siRNA, Figure 12). MutC-siRNA places the 
GAG deletion more centrally in the siRNA duplex. Because the central portion 
of the antisense strand of siRNA guides mRNA cleavage, it was reasoned that 
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placing the GAG deletion more centrally might enhance specific suppression of 
TAmut. As shown in Figure 13, mutC-slRNA siqipressed TAmut expression 
more specifically and rohustly than the other mut-siRNAs tested. In transfected 
cells, mutC-siENA suppressed TAmut to less than 0.5% of control levels, and 
5 had no effect on the expression of TAwt. 

To confinn allele-specific suppression by wt-siRNA and mutC-siRNA, 
respectively, the inventors cotransfected cells with GFP-TAwt or GFP-TAmut 
together with mis-siRNA, wt-siRNA or mutC-siRNA. Levels of TA expression 
were assessed 24 and 48 hours later by GFP fluorescence, and quantified the 

10 fluorescence signal firom multiple images was quantified. The results (Figure 
13D and 13E) confirmed the earlier western blots results in showing potent, 
specific silencing of TAwt and TAmut by wt-siRNA and mutC-siRNA, 
respectively, in cultured mammalian cells. 

AUde-specific silencing in simulated heterozygous state. In DYTl, both 

15 the mutant and wild type alleles are expressed. Once the efBcacy of siRNA 

silencmg was established, the inventors sought to confinn siRNA specificity for 
the targeted allele in cells that mimic the heterozygous state of DYTl. In plants 
and Caenorhabditis elegans, RNA-dependent RNA polymerase activity primed 
by introduction of exogenous RNA can result in the spread of silencing signals 

20 along the entire length of the targeted mRNA (Fire 1998, Tang 2003). No 
evidence for such a mechanism has been discovered in mammalian cells 
(Schwaiz 2002, Chiu 2002). Nonetheless it remained possible that silencing of 
the mutant allele might activate cellular processes that would also inhibit 
expression from flie wild type allele. To address this possibility, Cos-7 cells were 

25 cotransfected with both GFP-TAwt and HA-TAmut, and suppression by mis- 
siRNA, wt-siRNA or mutC-siRNA was assessed. As shown in Figure 14, potent 
and specific silencing of the targeted allele (either TAmut or TAwt) to levels less 
than 1% of controls was observed, with only slight suppression in the levels of 
the non-targeted protein. Thus, in cells expressing mutant and wild type forms of 

30 the protein, siRNA can suppress TAmut while sparing expression of TAwt 



84 



wo 2004/013280 PCTAJS2003/016887 
DISCUSSION 

In this study the inventors succeeded in generating siRNA that 
specifically and robustly suppresses mutant TA, the defective protein responsible 
for the most common form of primary generalized dystonia. The results have 
5 several implications for the treatment of DYTl dystonia. First and foremost, the 
siqjpression achieved was remarkably allele-specific, even in cells simulating the 
heterozygous state. In other words, efiBcient suppression of mutant TA occuired 
without significant reduction in wild type TA. Homozygous TA knockout mice 
die shortly after birth, while the heterozygous mice are normal (Goodchild 2002) 
10 , suggesting an essential fimction for TA. Thus, therapy for DYTl needs to 
elimmate the dominant negative or dominant toxic properties of the mutant 
protein while sustaining expression of the normal allele in order to prevent the 
deleterious consequences of loss of TA fimction. Selective siRNA-mediated 
suppression of the mutant allele fulfills these criteria without requiring detailed 
1 5 knowledge of the pathogenic mechanism. 

An pealing feature of the present siRNA therapy is applicable to all 
individuals afiOicted with DYTl. Except for one unusual case (Leung 2001, 
Doheny 2002, Klem 2002b), all persons with DYTl have the same (GAG) 
deletion mutation (Ozehus 1997, OzeUus 1999). This obviates the need to design 
20 individually tailored siRNAs. In addition, the fact that the DYTl mutation 

results m a fiiU three base pair difference from the wild type allele suggests that 
siRNA easily distinguishes mRNA derived firom normal and mutant TORI A 
alleles. 

It is important to recognize that DYTl is not a fiilly penetrant disease 
25 (Fahn 1 998, Klein 2002a) . Even when expressed maximally, mutant TA causes 
significant neurological dysfimction less than 50% of the time. Thus, even partial 
reduction of mutant TA levels might be sufficient to lower its pathological bram 
activity below a clinically detectable threshold. In addition, the DYTl mutation 
ahnost always manifests before age 25, suggesting that TAmut expression 
30 during a critical developmental window is required for sycaptom onset. This 
raises the possibility that suppressing TAmut expression during development 
might be sufficient to prevent symptoms throughout life. Finally, unUke many 
other inherited movement disorders DYTl is not characterized by progressive 
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neuTodegeneration, The clinical phenotype must result primarily from aeuronal 
dysfunction rather than neuronal cell death (Homjddewicz 1986, Walker 2002, 
Augood 2002, Augood 1999). This suggests the potential reversibility of DYTl 
by suppressing TAmut expression in overtly symptomatic persons. 

5 

Example 4 
siRNA Specific for Hantingtin's Disease 
The present inventors have developed huntingtin siKNA focused on two targets. 
One is non-allele specific (siHDexon2), flie olhcr is targeted to the exon 58 

10 codon deletion, the only known common intragenic polymorphism in linkage 
dysequilibiram with the disease mutation (Ambrose et al, 1994). Specifically, 
92% of wild type huntmgtin alleles have four GAGs in exon 58, while 38% of 
HD patients have 3 GAGs in exon 58. To assess a siRNA targeted to the 
intragenic polymorphism, PC6-3 cells were transfected with a fiill-length 

15 huntingtin containmg the exon 58 deletion. Specifically, PC6-3 rat 

pheochromocytoma cells were co-transfected with CMV-human Htt (37Qs) and 
U6 siRNA hairpin plasmids. Cell extracts were harvested 24 hours latCT and 
western blots were performed using 1 5 jig total protein extract. Primary 
antibody was an anti-huntingtin monoclonal antibody (MAB21 66, Chemicon) 

20 that reacts with himian, monkey, rat and mouse Htt proteins. 

As seen in Figure 15, the siRNA lead to silencing of the disease allele. 
As a positive control, a npn-allele specific siRNA targeted to exon 2 of the 
huntingtin gene was used. siRNA directed against GEP was used as a negative 
control. Note that only si£x58# 2 is fimctional. 

25 

All publications, patents and patent applications ^e incorporated herein- 
by reference. While in the foregoing specification this invention has been 
described in relation to certain preferred embodiments thereof, and many details 
have been set forth for purposes of illustration, it will be apparent to those skilled 
30 in the art that the invention is susceptible to additional embodiments and that 
certain of the details described herein naay be varied considerably without 
departing from the basic principles of the invration. 
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WHAT IS CLAIMED IS: 

1. A mammalian cell comprising an isolated first strand of RNA of 15 to 30 
nucleotides in length, and an isolated second strand of RNA of 15 to 30 
nucleotides in length, wherein the first strand comprises a sequence that 
is complementary to at least 15 contiguous nucleotides of a targeted gene 
of interest, wherein at least 12 nucleotides of the first and second strands 
are complementary to each other and form a small interfering RNA 
(siRNA) duplex under physiological conditions, and wherein flie siRNA 
silences only one allele of the targeted gene in the cell. 

2. The mammaUan ceU of claim 1, wherein the duplex is between 15 and 25 
base pairs in length. 



3. The mammaUan cell of any of the preceding claims, wherein the first 
15 and/or second strand fiirther comprise an overhang region. 

4. The mammahan cell of any of the preceding claims, wherein the first 
and/or second strand further comprises a 3' ovwhang region, a 5' 
overhang region, or both 3' and 5" overhang regions. 

20 

5. The mammalian cell of claims 3 or 4, wherein the overhang region is 
fix>m 1 to 10 nucleotides in length. 

6. The mammaUan cell of any of the preceding claims, wherein the first 
25 strand and the second strand are oporably linked by means of an RNA 

loop strand to form a hairpin structure comprising a di^plex structure and 
a loop structure. 

7. The mammalian cell of claim 6, wherein the loop structure contains from 
30 4 to 10 nucleotides. 



8. 



The mammalian cell of claim 6, whwein the loop structure contains 4, 5 
or 6 nucleotides. 
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9. A mammalian cell conqjrising an expression cassette encoding an 
isolated first strand of RNA of 15 to 30 nucleotides in length, and an 
isolated second strand of RNA of 15 to 30 nucleotides in length, wherein 

5 the first strand comprises a sequence that is complementary to at least 15 

contiguous nucleotides of a targeted gene of interest, wherein at least 12 
nucleotides of the first and second strands are complementary to each 
other and form a small mterfering RNA (siRNA) duplex under 
physiological conditions, and wherein the siRNA silences only one allele 
10 of the targeted gene in the cell. 

1 0. The mammalian cell of claim 9, wherein the expression cassette further 
comprises a promoter. 

15 11. The mammalian cell of claim 10, wherein the promoter is a regulatable 
promoter. 

12. The mammaUan cell of claim 10, wherein the promoter is a constitutive 
promoter. 

20 

13. The mammalian cell of claim 10, wherein the promoter is a CMV, RSV, 
pol H or pol in promoter. 

14. The mammalian cell of claim 9, wherein the expression cassette further 
25 comprises a polyadenylation signal. 

15. The mammalian cell of claim 14, wherein the polyadenylation signal is a 
synthetic minimal polyadenylation signal. 

30 16. The manunalian cell of claim 9, fiirthCT comprising a marker gene. 



17. 



The manmialian cell of any of claims 9-16, wherein the expression 
cassette is contained in a vector. 
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18. The mammalian ceU of any of claims 9-17, wherein the vector is an 
adenoviral, lentiviral, adeno-associated viral (AAV), poliovirus, HSV, or 
murine Maloney-based viral vector. 

19. The mammalian cell of any of the preceding claims, wherein the vector is 
an adenoviral vector. 



The mammaUan ceU of any of the preceding claims, wherein the targeted 
gene is a gene associated with a condition amenable to siRNA therapy. 

The mammalian cell of any of the preceding claims, wherein aUeles of 
the gene differ by seven or fewer base paira out of 21 base paira. 

The mammalian cell of any of the preceding claims, wherein the gene is 
a beta-glucuronidase gene. 



23. The mammaUan cell of any of the preceding claims, wherein the alleles 
are muxine-specific and human-specific alleles of beta-glucuronidse. 

20 

24. The mammalian cell of any of the preceding claims, wherein alleles of 
the gene differ by one base pair out of 21 base pairs. 

25. The mammalian cell of claim 24, wherein the gene encodes a transcript 
25 for TorsinA, Ataxin-3, Tau or huntingtin. 

26. An isolated RNA duplex comprising a first strand of RNA and a second 
strand of RNA, wherein the first strand comprises at least 15 contiguous 
nucleotides complementary to mutant TorsinA transcript encoded by 

30 SEQ E) NO:55, and wherein the second strand is complementary to at 

least 12 contiguous nucleotides of the first strand. 
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27. The RNA duplex of claim 26, wherein the first strand of RNA is encoded 
by SEQ E) NO:49 (mutA-si). 

28. The RNA duplex of claim 26, wherein the second strand of RNA is 
5 encoded by SEQ ID NO:50 (mutA-si). 

29. The RNA duplex of claim 26, wherein the first strand of RNA is encoded 
by SEQ ID N0:51 (mutB-si). 

10 30. The RNA duplex of claim 26, wherein the second strand of RNA is 
encoded by SEQ ID NO:52 (mutB-si). 



3 1 . The RNA duplex of claim 26, wherein the first strand of RNA is encoded 
by SEQ ID NO:53 (muC-si). 

15 

32. The RNA duplex of claim 26, wherein the second strand of RNA is 
encoded by SEQ ID NO:54 (mutC-si). 

33. An RNA duplex comprising a first strand of RNA and a second strand of 
20 RNA, wherein the first strand comprises at least 15 contiguous 

nucleotides complementary to mutant Ataxin-3 transcript encoded by 
SEQ ID N0:8, and wherein the second strand is complementary to at 
least 12 contiguous nucleotides of the first strand. 

25 34. The RNA duplex of claim 33, wherein the first strand of RNA is encoded 
bySEQIDNO:19(siC7/8). 

35. The RNA duplex of claim 33, wherein the second strand of RNA is 
encoded by SEQ ID NO: 20 (siC7/8). 

30 

36. The RNA duplex of claim 33, wherein the first strand of RNA is encoded 
bySEQIDNO:21(siC10). 
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37. The RNA duplex of claim 33, wherein the second strand of RNA is 
encoded by SEQ ID NO:22 (siClO). 

38. An RNA duplex comprising a first strand of RNA and a second strand of 
5 RNA, wherein the first strand comprises at least 1 5 contiguous 

nucleotides complementary to mutant Tau transcript encoded by SEQ ID 
NO:39 (siA9/C12), and wherein the second strand is complementary to at 
least 12 contiguous nucleotides of the Gist strand. 

10 39. The RNA duplex of claim 38, wherein the second strand of RNA is 
encoded by SEQ ID NO:40 (siA9/C12). 

The RNA duplex of any of claims 26-39, wherein the duplex is between 
15 and 30 base pairs in length. 

The RNA duplex of any of claims 26-40, wherein the dyjplex is between 
1 9 and 25 base pairs in length. 

The RNA duplex of any of claims 26-41, wherem the fiist and/or second 
strand further comprises an overhang region. 

The RNA duplex of any of claims 26-42, wherein the first and/or second 
strand further comprises a 3' overhang region, a 5' overhang region, or 
both 3' and 5' overhang regions. 

The RNA duplex of claim 42, wherein the overhang region is from 1 to 
10 nucleotides in length. 

The RNA duplex of any of claims 26-44, wherein the first strand and the 
second strand are operably linked by means of an RNA loop strand to 
form a hairpin structure comprising a duplex structure and a loop 
structure. 
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46. The RNA duplex of claim 45, wherein the loop structure contains from 4 
to 10 nucleotides. 

47. The RNA duplex of claim 45, wherem the loop structure contains 4, 5 or 
5 6 nucleotides. 

48. An expression cassette comprising a nucleic acid encoding at least one 
strand of the RNA duplex of any of claims 26-47. 

10 49. The expression cassette of claim 48, further comprising a promoter. 

50. The expression cassette of claim 49, wherein the promoter is a 
regulatable promoter. 

15 51. The expression cassette of claim 49, wherein the promoter is a 
constitutive promoter. 

52. The expression cassette of claim 49, wherein the promoter is a CMV, 
RS V, pol n or pol m promoter. 

20 

53. The expression cassette of any of claims 48-52, wherein the expression 
cassette further comprises a polyadenylation signal. 

54. The expression cassette of claim 53, wdierem the polyadenylation signal 
25 is a synthetic miniinal polyadenylation signal. 

55. The expression cassette of any of claims 48-54, further comprising a 
mark^ gene. 

30 56. A vector comprismg the expression cassette of any of claims 48-55. 

57. A vector comprising two expression cassettes, a first expression cassette 
comprising a nucleic acid encoding the first strand of the RNA duplex of 
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any of claims 26-47 and a second expression cassette comprisrag a 
nucleic acid encoding the second strand of the RNA duplex of any of 
claims 26-47. 

5 58. A cell comprising the expression cassette of any of claims 48-55. 

59. The cell of claim 58, wherein the cell is a mammalian cell. 

60. A non-himian mammal comprising the expression cassette of any of 
10 claims 48-55. 

61. A method of performing allele-specific gene silencing in a mammal 
comprising administering to the mammal an isolated jSrst strand of RNA 
of 15 to 30 nucleotides in length, and an isolated second strand of RNA 

15 of 15 to 30 nucleotides in length, wherein the first strand comprises a 

sequence that is complementary to at least 15 contiguous nucleotides of a 
targeted gene of interest, wherein at least 12 nucleotides of the first and 
second strands are complementary to each other and fonn a small 
interfering RNA (siRNA) duplex under physiological conditions, and 

20 wherein the siRNA silences only one allele of the targeted gene in the 

mammal. 

62. The method of claim 61, wherein alleles of the gene differ by seven or 
fewer base pairs out of 21 base pairs. 

25 

63. The method of claim 61, wherein the gene is a beta-gjucuronidase gene. 

64. The method of claim 61, wherein the alleles are murine-specific and 
human-specific alleles of beta-glucuronidse. 

30 

65. The method of claim 61, wherein alleles of the gene differ by one base 
pair out of 21 base pairs. 
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66. The method of claim 65, wherein the gene encodes a transcript for 
TorsmA, Ataxin-3, Tau or huntingtin . 

67. The method of claim 61 , wherein the targeted geae is a gene associated 
5 with a condition amenable to siKNA therapy. 

68. The method of claim 67, wherein the condition amenable to siSNA 
ther^y is a neurodegenerative disease. 

10 69. The method of claim 68, wherein the neurodeg^erative disease is a 
trinucleotide-repeat disease. 

70. The method of claim 69, wherein the trinucleotide-repeat disease is a 
disease associated with polyglutamine repeats. 

15 

71 . The method of claim 70, wherein the trinucleotide-repeat disease is 
Himtington*s disease or a spinocerebellar ataxia (SCA). 

72. The method of claim 71, wherein the SCA is SCAl, SCA2, SCA3, 
20 . SCA6, SCA7, or SCA17. 



73. The method of claim 61, wherein the targeted gene encodes a ligand for a 
chemokme involved in the migration of a cancer cell, or a chemokine 

25 receptor. 

74. A method of substantially silencing a targeted allele while allowing 
substantially continued e?q}ression of a wild-type allele comprising 
contacting a cell with an expression cassette, wherein &e expression 

30 cassette comprises a nucleic acid sequence encoding a small interfering 

RNA molecule (siRNA) targeted against the targeted allele, wherein 
expression from the targeted allele is substantially silenced but wherein 
expression of the wild-type allele is not substantially silenced. 
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75. A method of treating dommantly inherited disease in an allele-specific 
manner comprising administering to a patient m need thereof an 
expression cassette, wherein the expression cassette comprises a nucleic 
5 acid sequence encoding a small interfering RNA molecule (siRNA) 

targeted against a targeted allele, wherein expression from the targeted 
allele is substantially silenced but wherein expression of the wild-type 
allele is not substantially silenced. 

10 76. A method of performing allele-specific gene silencing comprising 
administOTng an expression cassette comprising a pol n promoter 
operably-linked to at least one strand of a nucleic acid encoding a small 
interfering RNA molecule (siRNA) targeted against a gene of interest, 
wherem the siRNA silences only one allele of a gene. 

15 

77. A method of performing allele-specific gene silencing in a mammal 
comprising administering to the mammal a vector comprising an 
expression cassette, wherein the expression cassette comprises a nucleic 
acid encoding at least one strand a small interfering RNA molecule 

20 (siKNA) targeted against a gene of interest, wherein the siRNA silences 

only one allele of a gene. 

78. A method of screening of allele-specific siRNA duplexes comprising 

(a) contacting a cell containing a predetermined mutant allele with an 
25 siRNA witti a known sequence, 

(b) contacting a cell containing a wild-type allele with an siRNA with 
a known sequence, and 

(c) determining if the mutant allele is substantially silenced while the 
wild-type allele retains substantially normal activity. 



30 



79, A method of screenmg of allele-specific siRNA duplexes comprising 
(a) contacting a cell containing a predetermined mutant allele and a 
wild-type allele with an siRNA with a known sequence, and 
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(b) determining if the mutant allele is substantially silenced while the 
wild-type allele retains substantially normal activity. 

80. A method for determining the function of an allele comprising: 

5 (a) contacting a cell containing a predetermined allele with an siRNA 

with a known sequence, and 

(b) determining if the function of the allele is substantially modified. 

81. A method for determining the function of an allele comprising: 

10 (a) contacting a cell containing a predetermined mutant aUele and a 

wild-type allele with an siKNA with a known sequence, and 
(b) determining if the function of the allele is substantially modified 
while the wild-type allele retains substantially normal function. 



15 
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Primer Sequence (5' -3') 
CGGcaAQcrrGCGCMaAAGTTcfr6***006UO siw ' -Tau 

MG2UWC7rTGArGGTCSW3CTT6C(Sg OlPUGU^ 
ATGAACTTC3tf3GGTCAGCTTGC(^ *a » 0 W Rfci272 

CTGTCCTGA'EAGGTCOOGCTGC taGUlP*i6\M9) 

CAGCAGCAGGGGGftCCTATG 
CTaATAGOTOCXXJCTGCTGC L^(^ * pA>(>'*KS 

CXGTCCTGATAaaTOCGOCTGC^^t VPOCktO^ 

CAGCAGCAGOC5GGACCTATc(^6C^ llPA> 6' t|>lA9/C8 
CTGATAGaTOCCGGTGCTGC(SC;CSl\0 Ut'.t^ 



Prlzner Sequenoo (S'-3') 

CAfiCGTGATCTTCXMCACTTC Ut(^JPW6^^ 

GAGCOSGGAGTaGaGJ^ASOTGC C5>6 UU>P(i^2?$ 
CTOCACClTOCOGACrCSCCGGC IPOfi^^ 

ACGTOCTOSGCGGCGGCAGTGTGC (SEtiOPtoV^ll 

TTGGAcavcTGCOGOcrccGCJaaAC t%C%lPA>G: 

ACGTCTCCATGGCATCTGAGC 
TTGCTGAGATGCGATGGASACC^E CM\P U 

GTGGOCAGATGGAAiOTAAAArc(5£C^ l^OA)^:^'?') 
CAGATTTTACITCCATCTGGCC 

GTGGCCACATGGAAGTAAAATC C$6Cfe W?/^^*'^?} 
Ca«3MTTTACTTCCATGTGGOCtbfc^ VPjOCkVJtA 



TTGAAAAAGAGCAGCAAlJ!UGClSfr^ilV30ft23^^ GTGGCCAGATGGAAGTAAAATct^Cfe IPP«>:'3^') 

CTGCTTTTGCTGCTGTTTTTC O^Et^flPAJWH) CAGATTTTACTTGGATCTGGCCefi^^ VO»V-MO 

CAGCAGCAGCAGCAGCAGCAGCCbS \Wt)L^ 
CTGCTGCTGCTGCTGCTGCTGC fSEU |DA^6*.1^) 
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SEQUENCE LISTING 
<110> University of Iowa Research Foundation et al. 

5 

<120> Allele-Specif ic Silencing of Disease Genes 

<130> 875.084WO1 

10<150> US 10/212,322 
<151> 2002-08-05 

<150> US 10/322,086 
<151> 2002-12-17 

15 

<150> US 10/430,351 
<151> 2003-05-05 

<160> 55 

20 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 40 
25<212> DNA 

<213> Artificial Sequence 

<220> 

<223> A primer 

30 

<400> 1 

aaggtaccag atcttagtta ttaatagtaa tcaattacgg 40 

<210> 2 
35<211> 43 
<212> DNA 

<213> Artificial Sequence 

<220> 
40<223> A primer 



<400> 2 

gaatcgatgc atgcctcgag acggttcact aaaccagctc tgc 
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2 

<210> 3 
<211> 69 
<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> An oligonucleotide, used with SEQ ID N0:4, to form a minimal polyA 



<400> 3 

lOctagaactag taataaagga tcctttattt tcattggatc cgtgtgttgg ttttttgtgt 60 
gcggccgcg 69 



<210> 4 
<211> 69 
X5<212> DNA 

<213> Artificial Sequence 



<220> 

<223> An oligonucleotide, used with SEQ ID N0:3, to form a minimal polyA 

20 

<400> 4 

tcgacgcggc cgcacacaaa aaaccaacac acggatccaa tgaaaataaa ggatccttta 60 
ttactagtt 69 



25<210> 5 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
30<220> 

<223> A P32 labeled sense oligonucleotide used to probe a blot 
<400> 5 

cacaagctgg agtacaacta c 21 

35 

<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

40 

<220> 

<223> A P32 labeled antisense oligonucleotide used to probe a blot 
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3 

<400> 6 

gtacttgtac tccagctttg tg 

<210> 7 
5<211> 28 
<212> DNA 
<213> Homo sapiens 



<400> 7 

lOcagcagcagc agggggacct atcaggac 

<210> 8 
<211> 28 
<212> DNA 
15<213> Homo sapiens 

<400> 8 

cagcagcagc agcgggacct atcaggac 

20<210> 9 

<211> 17 
<212> DNA 

<213> Artificial Sequence 

25<220> 

<223> T7 promoter sequence 



<400> 9 

tatagtgagt cgtatta 

I 

<210> 10 
<Z11> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A primer was annealed to all oligos to synthesize siRNAs 



<400> 10 
40taatacgact cactatag 



18 
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<210> 11 

<211> 22 

<212> DNA 

<213> Homo sapiens 

5 

<400> 11 

cggcaagctg cgcatgaagt tc 22 

<210> 12 
10<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 12 

ISatgaacttca tgctcagctt gc 22 

<210> 13 
<211> 22 
. <212> DNA 
20<213> Homo sapiens 

<400> 13 

atgaacttca gggtcagctt gc 22 

25<210> 14 
<211> 22 
<212> DNA 
<213> Homo sapiens 

30<400> 14 

cggcaagctg accctgaagt tc 22 

<210> 15 
<211> 22 
35<212> DNA 

<213> Homo sapiens 



<400> 15 

cagcagcggg acctatcagg ac 

40 



22 
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<210> 16 
<211> 22 

<212> DNA 

<213> Homo sapiens 

5 

<400> 16 

ctgtcctgat aggtcccgct gc 22 

<210> 17 
10<211> 20 
<212> DNA 
<213> Homo sapiens 

<400> 17 

IScagcagcagg gggacctatc 20 

<210> 18 
<211> 20 
<212> DNA 
20<213> Homo sapiens 

<400> 18 

ctgataggtc cccctgctgc 20 

25<210> 19 
<211> 22 
<212> DNA 
<213> Homo sapiens 

30<400> 19 

cagcagccgg acctatcagg ac 22 

<210> 20 
<211> 22 
35<212> DNA 

<213> Homo sapiens 



<400> 20 

ctgtcctgat aggtccggct gc 

40 



22 
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<210> 21 
<211> 20 

<212> DNA 

<213> Homo sapiens 

5 

<400> 21 

cagcagcagc gggacctatc 

<210> 22 

10<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 22 
isctgataggtc ccgctgctgc 

<210> 23 
<211> 21 
<212> DNA 
20<213> Homo sapiens 

<400> 23 

ttgaaaaaca gcagcaaaag c 

25<210> 24 
<211> 21 
<212> DNA 
<213> Homo sapiens 

30<400> 24 

ctgcttttgc tgctgttttt c 

<210> 25 
<211> 22 
35<212> DNA 

<213> Homo sapiens 

<400> 25 

cagcagcagc agcagcagca gc 

40 
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20 



21 



21 
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<210> 26 

<211> 22 
<212> DNA 

<213> Homo sapiens 

5 

<400> 26 

ctgctgctgc tgctgctgct gc 

<210> 27 
10<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 27 
15'tcgaagtgat ggaagatcac gc 

<210> 28 
<211> 22 
<2X2> DNA 
20<213> Homo sapiens 

<400> 28 

cagcgtgatc ttccatcact tc 

25<210> 29 ' 
<211> 22 
<212> DNA 

<213> Homo sapiens 

30<400> 29 

cagccgggag tcgggaaggt gc 

<210> 30 
<211> 22 
35<212> DNA 

<213> Homo sapiens 

<400> 30 

ctgcaccttc ccgactcccg gc 

40 



wo 2004/013280 



<210> 31 

<211> 24 

<212> DNA 

<213> Homo sapiens 

5 

<400> 31 

acgtcctcgg cggcggcagt gtgc 

<210> 32 
10<211> 24 
<212> DNA 

<213> Homo sapiens 

<400> 32 
ISttgcacactg ccgcctccgc ggac 

<210> 33 

<211> 21 

<212> DNA 

20<213> Homo sapiens 

<400> 33 

acgtctccat ggcatctcag c 

25<210> 34 

<211> 21 

<212> DNA 

<213> Homo sapiens 

30<400> '34 

ttgctgagat gccatggaga c 

<210> 35 

<211> 22 

35<212> DNA 

<213> Homo sapiens 

<400> 35 

gtggccagat ggaagtaaaa tc 

40 



PCT/US2003/016887 

8 



24 



24 



21 



21 
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<210> 36 

<211> 22 

<212> DNA 

<213> Homo sapiens 

5 

<400> 36 

cagattttac ttccatctgg cc 

<210> 37 
10<211> 22 
<212> DNA 
<213> Homo sapiens 

<400> 37 
ISgtggccacat ggaagtaaaa tc 

<210> 38 
<211> 22 
<212> DNA 
20<213> Homo sapiens 

<400> 38 

cagattttac ttccatgtgg cc 

25<210> 39 
<211> 22 
<212> DNA 
<213> Homo sapiens 

30<400> 39 

gtggccagat gcaagtaaaa tc 

<210> 40 
<211> 22 
35<212> DNA 

<213> Homo sapiens 



9 



22 



22 



22 



22 



<400> 40 

cagattttac ttgcatctgg cc 

40 



22 
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<210> 41 
<211> 22 

<212> DNA 

<213> Homo sapiens 

5 

<400> 41 

gtggccaggt ggaagtaaaa tc 

<210> 42 
10<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 42 
ISatgaacttca tgctcagctt go 

<210> 43 
<211> 22 
<212> DNA 
20<213> Homo sapiens 

<400> 43 

cggcaagctg agcatgaagt tc 

25<210> 44 
<211> 22 
<212> DNA 
<213> Homo sapiens 

30<400> 44 

cagtggcttc tggcacagca gc 

<210> 45 
<211> 22 
35<212> DNA 

<213> Homo sapiens 

<400> 45 

aagctgctgt gccagaagcc ac 

40 



22 



22 



22 



22 
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11 

<210> 46 
<211> 42 
<212> DNA 

<213> Homo sapiens 

5 

<400> 46 

gtaagcagag tggctgagga gatgacattt ttccccaaag ag 42 

<210> 47 
10<211> 21 
<212> DNA 
<213> Homo sapiens 

<400> 47 

IScagagtggct gaggagatga c 21 

<210> 4B 
<211>* 21 
<212> DNA 
20<213> Homo sapiens 

<400> 48 

gtgtcatctc ctcagccact c 21 

25<210> 49 
<211> 18 
<212> DNA 
<213> Homo sapiens 

30<400> 49 

cagagtggct gagatgac 18 

<210> 50 
<211> 18 
35<212> DNA 

<213> Homo sapiens 



<400> 50 

atgtcatctc agccactc 



18 
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12 

<210> 51 

<211> 20 

<212> DNA 

<213> Homo sapiens 

5 

<400> 51 

ctgagatgac atttttcccc 

<210> 52 
10<211> 20 
<212> DMA 
<213> Homo sapiens 

<400> 52 
ISttggggaaaa atgtcatctc 

<210> 53 
<211> 23 
<212> DNA 
20<213> Homo sapiens 

<4Q0> 53 

gagtggctga gatgacattt tfcc 

25<210> 54 
<211> 23 
<212> DNA 
<213> Homo sapiens 

30<400> 54 

gggaaaaatg tcatctcagc cac 

<210> 55 
<211> 39 
35<212> DNA 

<213> Homo sapiens 

<400> 55 * 
gtaagcagag tggctgagat gacatttttc cccaaagag 

40 
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20 



20 



23 



23 



